Results 1 - 10
of
24
Shot change detection via local keypoint matching
- IEEE Trans. Multimedia
, 2008
"... Abstract—Shot change detection is an essential step in video content analysis. However, automatic shot change detection often suffers from high false detection rates due to camera or object movements. To solve this problem, we propose an approach based on local keypoint matching of video frames. Thi ..."
Abstract
-
Cited by 10 (2 self)
- Add to MetaCart
(Show Context)
Abstract—Shot change detection is an essential step in video content analysis. However, automatic shot change detection often suffers from high false detection rates due to camera or object movements. To solve this problem, we propose an approach based on local keypoint matching of video frames. This approach aims to detect both abrupt and gradual transitions between shots without modeling different kinds of transitions. Our experiment results show that the proposed algorithm is effective for most kinds of shot changes. Index Terms—Invariant local feature, matching, recognition, shot change detection. I.
Intrinsic dimensionality predicts the saliency of natural dynamic scenes
- IEEE Trans. on Pattern Analysis and Machine Intelligence
, 2012
"... Abstract — Since visual attention-based computer vision appli-cations have gained popularity, ever more complex, biologically-inspired models seem to be needed to predict salient locations (or interest points) in naturalistic scenes. In this paper, we explore how far one can go in predicting eye mov ..."
Abstract
-
Cited by 7 (2 self)
- Add to MetaCart
(Show Context)
Abstract — Since visual attention-based computer vision appli-cations have gained popularity, ever more complex, biologically-inspired models seem to be needed to predict salient locations (or interest points) in naturalistic scenes. In this paper, we explore how far one can go in predicting eye movements by using only basic signal processing, such as image representations derived from efficient coding principles, and machine learning. To this end, we gradually increase the complexity of a model from simple single-scale saliency maps computed on grayscale videos to spatio-temporal multiscale and multispectral representations. Using a large collection of eye movements on high-resolution videos, supervised learning techniques fine-tune the free param-eters whose addition is inevitable with increasing complexity. The proposed model, although very simple, demonstrates significant improvement in predicting salient locations in naturalistic videos over four selected baseline models and two distinct data labelling scenarios. Index Terms — Computational models of vision, video analysis, computer vision, spatio-temporal saliency, eye movement predic-tion, intrinsic dimension, visual attention, interest point detection. I.
Vlogging: A survey of videoblogging technology on the web
- ACM Computing Surveys
, 2010
"... In recent years, blogging has become an exploding passion among Internet communities. By combining the grassroots blogging with the richness of expression available in video, videoblogs (vlogs for short) will be a powerful new media adjunct to our existing televised news sources. Vlogs have gained m ..."
Abstract
-
Cited by 4 (0 self)
- Add to MetaCart
In recent years, blogging has become an exploding passion among Internet communities. By combining the grassroots blogging with the richness of expression available in video, videoblogs (vlogs for short) will be a powerful new media adjunct to our existing televised news sources. Vlogs have gained much attention worldwide, especially with Google’s acquisition of YouTube. This article presents a comprehensive survey of videoblogging (vlogging for short) as a new technological trend. We first summarize the technological chal-lenges for vlogging as four key issues that need to be answered. Along with their respective possibilities, we give a review of the currently available techniques and tools supporting vlogging, and envision emerg-ing technological directions for future vlogging. Several multimedia technologies are introduced to empower vlogging technology with better scalability, interactivity, searchability, and accessability, and to potentially reduce the legal, economic, and moral risks of vlogging applications. We also make an in-depth investiga-tion of various vlog mining topics from a research perspective and present several incentive applications such as user-targeted video advertising and collective intelligence gaming. We believe that vlogging and its
Simultaneous detection of abrupt cuts and dissolves in videos . . .
- PATTERN RECOGNITION LETTERS
, 2009
"... ..."
Fuzzy Color Histogram-based Video Segmentation
- COMPUTER VISION AND IMAGE UNDERSTANDING
, 2010
"... We present a fuzzy color histogram-based shot-boundary detection algorithm specialized for content based copy detection applications. The proposed method aims to detect both cuts and gradual transitions (fade, dissolve) effectively in videos where heavy transformations (such as cam-cording, insertio ..."
Abstract
-
Cited by 3 (1 self)
- Add to MetaCart
We present a fuzzy color histogram-based shot-boundary detection algorithm specialized for content based copy detection applications. The proposed method aims to detect both cuts and gradual transitions (fade, dissolve) effectively in videos where heavy transformations (such as cam-cording, insertions of patterns, strong re-encoding) occur. Along with the color histogram generated with the fuzzy linking method on L*a*b* color space, the system extracts a mask for still regions and the window of picture-in-picture transformation for each detected shot, which will be useful in a content-based copy detection system. Experimental results show that our method effectively detects shot boundaries and reduces false alarms as compared to the state-of-the-art shot-boundary detection algorithms.
Context-sensitive queries for image retrieval in digital libraries
- J INTELL INF SYST
, 2007
"... ..."
GAZE SHIFTS AS DYNAMICAL RANDOM SAMPLING
"... We discuss how gaze behavior of an observer can be simulated as a Monte Carlo sampling of a distribution obtained from the saliency map of the observed image. To such end we propose the Levy Hybrid Monte Carlo algorithm, a dynamic Monte Carlo method in which the walk on the distribution landscape is ..."
Abstract
-
Cited by 1 (0 self)
- Add to MetaCart
(Show Context)
We discuss how gaze behavior of an observer can be simulated as a Monte Carlo sampling of a distribution obtained from the saliency map of the observed image. To such end we propose the Levy Hybrid Monte Carlo algorithm, a dynamic Monte Carlo method in which the walk on the distribution landscape is modelled through Levy flights. Some preliminary results are presented comparing with data gathered by eye-tracking human observers involved in an emotion recognition task from facial expression displays.
Differential Edit Distance: A metric for scene segmentation evaluation
"... Abstract—In this work a novel approach to evaluating video temporal decomposition algorithms is presented. The evaluation measures typically used to this end are non-linear combinations of Precision-Recall or Coverage-Overflow, which are not metrics and additionally possess undesirable properties, s ..."
Abstract
-
Cited by 1 (0 self)
- Add to MetaCart
(Show Context)
Abstract—In this work a novel approach to evaluating video temporal decomposition algorithms is presented. The evaluation measures typically used to this end are non-linear combinations of Precision-Recall or Coverage-Overflow, which are not metrics and additionally possess undesirable properties, such as nonsymmetricity. To alleviate these drawbacks we introduce a novel uni-dimensional measure that is proven to be metric and satisfies a number of qualitative prerequisites that previous measures do not. This measure is named Differential Edit Distance (DED), since it can be seen as a variation of the well-known edit distance. After defining DED, we further introduce an algorithm that computes it in less than cubic time. Finally, DED is extensively compared with state of the art measures, namely the harmonic means (F-Score) of Precision-Recall and Coverage-Overflow. The experiments include comparisons of qualitative properties, the time required for optimizing the parameters of scene segmentation algorithms with the help of these measures, and a user study gauging the agreement of these measures with the users ’ assessment of the segmentation results. The results confirm that the proposed measure is a uni-dimensional metric that is effective in evaluating scene segmentation techniques and in helping to optimize their parameters. I.
Video Classification and Shot Detection for Video Retrieval Applications
"... Appropriate organization of video databases is essential for pertinent indexing and retrieval of visual information. This paper proposes a new feature called Block Intensity Comparison Code (BICC) for video classification and an unsupervised shot change detection algorithm to detect the shot changes ..."
Abstract
- Add to MetaCart
Appropriate organization of video databases is essential for pertinent indexing and retrieval of visual information. This paper proposes a new feature called Block Intensity Comparison Code (BICC) for video classification and an unsupervised shot change detection algorithm to detect the shot changes in a video stream using autoassociative neural network (AANN) which makes retrieval problems much simpler. BICC represents the average block intensity difference between blocks of a frame. A novel AANN misclustering rate (AMR) algorithm is used to detect the shot transitions. The experiments demonstrate the effectiveness of the proposed methods.
Peking University and
"... In recent years, blogging has become an exploding passion among Internet communities. By combining the grassroots blogging with the richness of expression available in video, videoblogs (vlogs for short) will be a powerful new media adjunct to our existing televised news sources. Vlogs have gained m ..."
Abstract
- Add to MetaCart
In recent years, blogging has become an exploding passion among Internet communities. By combining the grassroots blogging with the richness of expression available in video, videoblogs (vlogs for short) will be a powerful new media adjunct to our existing televised news sources. Vlogs have gained much attention worldwide, especially with Google’s acquisition of YouTube. This article presents a comprehensive survey of videoblogging (vlogging for short) as a new technological trend. We first summarize the technological challenges for vlogging as four key issues that need to be answered. Along with their respective possibilities, we give a review of the currently available techniques and tools supporting vlogging, and envision emerging technological directions for future vlogging. Several multimedia technologies are introduced to empower vlogging technology with better scalability, interactivity, searchability, and accessability, and to potentially reduce the legal, economic, and moral risks of vlogging applications. We also make an in-depth investigation of various vlog mining topics from a research perspective and present several incentive applications such as user-targeted video advertising and collective intelligence gaming. We believe that vlogging and its