Results 1 - 10
of
19
Pedestrian Detection: An Evaluation of the State of the Art
- SUBMISSION TO IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 1
"... Pedestrian detection is a key problem in computer vision, with several applications that have the potential to positively impact quality of life. In recent years, the number of approaches to detecting pedestrians in monocular images has grown steadily. However, multiple datasets and widely varying e ..."
Abstract
-
Cited by 174 (10 self)
- Add to MetaCart
Pedestrian detection is a key problem in computer vision, with several applications that have the potential to positively impact quality of life. In recent years, the number of approaches to detecting pedestrians in monocular images has grown steadily. However, multiple datasets and widely varying evaluation protocols are used, making direct comparisons difficult. To address these shortcomings, we perform an extensive evaluation of the state of the art in a unified framework. We make three primary contributions: (1) we put together a large, well-annotated and realistic monocular pedestrian detection dataset and study the statistics of the size, position and occlusion patterns of pedestrians in urban scenes, (2) we propose a refined per-frame evaluation methodology that allows us to carry out probing and informative comparisons, including measuring performance in relation to scale and occlusion, and (3) we evaluate the performance of sixteen pre-trained state-of-the-art detectors across six datasets. Our study allows us to assess the state of the art and provides a framework for gauging future efforts. Our experiments show that despite significant progress, performance still has much room for improvement. In particular, detection is disappointing at low resolutions and for partially occluded pedestrians.
A MULTI-LEVEL MIXTURE-OF-EXPERTS FRAMEWORK FOR PEDESTRIAN CLASSIFICATION
, 2011
"... Notwithstanding many years of progress, pedestrian recognition is still a difficult but important problem. We present a novel multi-level Mixture-of-Experts approach to combine information from multiple features and cues with the objective of improved pedestrian classification. On pose-level, shape ..."
Abstract
-
Cited by 16 (3 self)
- Add to MetaCart
Notwithstanding many years of progress, pedestrian recognition is still a difficult but important problem. We present a novel multi-level Mixture-of-Experts approach to combine information from multiple features and cues with the objective of improved pedestrian classification. On pose-level, shape cues based on Chamfer shape matching provide sample-dependent priors for a certain pedestrian view. On modality-level, we represent each data sample in terms of image intensity, (dense) depth and (dense) flow. On feature-level, we consider histograms of oriented gradients (HOG) and local binary patterns (LBP). Multilayer perceptrons (MLP) and linear support vector machines (linSVM) are used as expert classifiers. Experiments are performed on a unique real-world multimodality dataset captured from a moving vehicle in urban traffic. This dataset has been made public for research purposes. Our results show a significant performance boost of up to a factor of 42 in reduction of false positives at constant detection rates of our approach compared to a baseline intensity-only HOG/linSVM approach.
Multi-Class Open Set Recognition Using Probability of Inclusion
"... Abstract. The perceived success of recent visual recognition approaches has largely been derived from their performance on classification tasks, where all possible classes are known at training time. But what about open set problems, where unknown classes appear at test time? Intuitively, if we coul ..."
Abstract
-
Cited by 4 (3 self)
- Add to MetaCart
(Show Context)
Abstract. The perceived success of recent visual recognition approaches has largely been derived from their performance on classification tasks, where all possible classes are known at training time. But what about open set problems, where unknown classes appear at test time? Intuitively, if we could accurately model just the positive data for any known class without overfitting, we could reject the large set of unknown classes even under an assumption of incomplete class knowledge. In this paper, we formulate the problem as one of modeling positive training data at the decision boundary, where we can invoke the statistical extreme value theory. A new algorithm called the PI-SVM is introduced for estimating the unnormalized posterior probability of class inclusion. 1
PROPRE: PROjection and PREdiction for multimodal correlations learning. An
"... application to pedestrians visual data discrimination. ..."
Abstract
-
Cited by 3 (3 self)
- Add to MetaCart
(Show Context)
application to pedestrians visual data discrimination.
Person Appearance Modeling and Orientation Estimation using Spherical Harmonics
"... Abstract — We present a novel approach for the joint estimation of a person’s overall body orientation, 3D shape and texture, from overlapping cameras. Overall body orientation (i.e. rotation around torso major axis) is estimated by minimizing the difference between a learned texture model in a cano ..."
Abstract
-
Cited by 2 (0 self)
- Add to MetaCart
(Show Context)
Abstract — We present a novel approach for the joint estimation of a person’s overall body orientation, 3D shape and texture, from overlapping cameras. Overall body orientation (i.e. rotation around torso major axis) is estimated by minimizing the difference between a learned texture model in a canonical orientation and a texture sampled using the current 3D shape estimate (i.e. torso and head). The estimated body orientation subsequently allows to update the 3D shape estimate, taking into account the new 3D shape measurement obtained by volume carving. Our main contribution is a method for estimating a person’s relative body orientation while simultaneously generating a basic Spherical Harmonics based model of the person’s shape and texture. Experiments show that the proposed method outperforms two state-of-the-art orientation estimation methods: one combining a fixed 3D shape model with a generate-and-test texture matching approach and one using a classifier based approach. I.
ESTIMATION OF HUMAN ORIENTATION BASED ON SILHOUETTES AND MACHINE LEARNING PRINCIPLES
"... Abstract: Estimating the orientation of the observed person is a crucial task for home entertainment, man-machine in-teraction, intelligent vehicles, etc. This is possible but complex with a single camera because it only provides one side view. To decrease the sensitivity to color and texture, we us ..."
Abstract
-
Cited by 1 (0 self)
- Add to MetaCart
(Show Context)
Abstract: Estimating the orientation of the observed person is a crucial task for home entertainment, man-machine in-teraction, intelligent vehicles, etc. This is possible but complex with a single camera because it only provides one side view. To decrease the sensitivity to color and texture, we use the silhouette to infer the orientation. Under these conditions, we show that the only intrinsic limitation is to confuse the orientation θ with the sup-plementary angle (that is 180°−θ), and that the shape descriptor must distinguish between mirrored images. In this paper, the orientation estimation is expressed and solved in the terms of a regression problem and super-vised learning. In our experiments, we have tested and compared 18 shape descriptors; the best one achieves a mean error of 5.24°. However, because of the intrinsic limitation mentioned above, the range of orientations is limited to 180°. Our method is easy to implement and outperforms existing techniques. 1
Single-Pedestrian Detection Aided by 2-Pedestrian Detection
- IEEE TRANSACTIONS PATTERN ANALYSIS AND MACHINE INTELLIGENCE
"... In this paper, we address the challenging problem of detecting pedestrians who appear in groups. A new approach is proposed for single-pedestrian detection aided by 2-pedestrian detection. A mixture model of 2-pedestrian detectors is designed to capture the unique visual cues which are formed by nea ..."
Abstract
-
Cited by 1 (1 self)
- Add to MetaCart
In this paper, we address the challenging problem of detecting pedestrians who appear in groups. A new approach is proposed for single-pedestrian detection aided by 2-pedestrian detection. A mixture model of 2-pedestrian detectors is designed to capture the unique visual cues which are formed by nearby pedestrians but cannot be captured by single-pedestrian detectors. A probabilistic framework is proposed to model the relationship between the configurations estimated by single- and 2-pedestrian detectors, and to refine the single-pedestrian detection result using 2-pedestrian detection. The 2-pedestrian detector can integrate with any single-pedestrian detector. 25 state-of-the-art single-pedestrian detection approaches are combined with the 2-pedestrian detector on three widely used public datasets: Caltech, TUD-Brussels, and ETH. Experimental results show that our framework improves all these approaches. The average improvement is 9 % on the Caltech-Test dataset, 11 % on the TUD-Brussels dataset and 17 % on the ETH dataset in terms of average miss rate. The lowest average miss rate is reduced from 37%
Context-Based Pedestrian Path Prediction
"... Abstract. We present a novel Dynamic Bayesian Network for pedestrian path prediction in the intelligent vehicle domain. The model incorporates the pedes-trian situational awareness, situation criticality and spatial layout of the environ-ment as latent states on top of a Switching Linear Dynamical S ..."
Abstract
-
Cited by 1 (0 self)
- Add to MetaCart
(Show Context)
Abstract. We present a novel Dynamic Bayesian Network for pedestrian path prediction in the intelligent vehicle domain. The model incorporates the pedes-trian situational awareness, situation criticality and spatial layout of the environ-ment as latent states on top of a Switching Linear Dynamical System (SLDS) to anticipate changes in the pedestrian dynamics. Using computer vision, situational awareness is assessed by the pedestrian head orientation, situation criticality by the distance between vehicle and pedestrian at the expected point of closest ap-proach, and spatial layout by the distance of the pedestrian to the curbside. Our particular scenario is that of a crossing pedestrian, who might stop or continue walking at the curb. In experiments using stereo vision data obtained from a ve-hicle, we demonstrate that the proposed approach results in more accurate path prediction than only SLDS, at the relevant short time horizon (1 s), and slightly outperforms a computationally more demanding state-of-the-art method.
A Probabilistic Framework for Joint Pedestrian Head and Body Orientation Estimation
"... Abstract-We present a probabilistic framework for the joint estimation of pedestrian head and body orientation from a mobile stereo vision platform. For both head and body parts, we convert the responses of a set of orientation-specific detectors into a (continuous) probability density function. Th ..."
Abstract
- Add to MetaCart
(Show Context)
Abstract-We present a probabilistic framework for the joint estimation of pedestrian head and body orientation from a mobile stereo vision platform. For both head and body parts, we convert the responses of a set of orientation-specific detectors into a (continuous) probability density function. The parts are localized by means of a pictorial structure approach, which balances part-based detector responses with spatial constraints. Head and body orientations are estimated jointly to account for anatomical constraints. The joint single-frame orientation estimates are integrated over time by particle filtering. The experiments involved data from a vehicle-mounted stereo vision camera in a realistic traffic setting; 65 pedestrian tracks were supplied by a state-of-the-art pedestrian tracker. We show that the proposed joint probabilistic orientation estimation framework reduces the mean absolute head and body orientation error up to 15 • compared with simpler methods. This results in a mean absolute head/body orientation error of about 21 • /19 • , which remains fairly constant up to a distance of 25 m. Our system currently runs in near real time (8-9 Hz).
Head Detection and Orientation Estimation for Pedestrian Safety
"... Abstract-Head detection and orientation estimation are a vital component in the intention recognition of pedestrians. In this paper we propose a novel framework to detect highly occluded pedestrians and estimate their head orientation. Detection is performed for pedestrian's heads only. For th ..."
Abstract
- Add to MetaCart
(Show Context)
Abstract-Head detection and orientation estimation are a vital component in the intention recognition of pedestrians. In this paper we propose a novel framework to detect highly occluded pedestrians and estimate their head orientation. Detection is performed for pedestrian's heads only. For this we employ a part-based classifier with HOG/SVM combinations. Head orientations are estimated using discrete orientation classifiers and LBP features. Results are improved by leveraging orientation estimation for head and torso as well as motion information. The orientation estimation is integrated over time using a Hidden Markov Model. From the discrete model we obtain a contiunous head orientation. We evaluate our approach on image sequences with ground truth orientation measurements. To our best knowledge we outperform state of the art results.