• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations

Tools

Sorted by:
Try your query at:
Semantic Scholar Scholar Academic
Google Bing DBLP
Results 1 - 10 of 1,470
Next 10 →

Learning realistic human actions from movies

by Ivan Laptev, Marcin Marszałek, Cordelia Schmid, Benjamin Rozenfeld - IN: CVPR. , 2008
"... The aim of this paper is to address recognition of natural human actions in diverse and realistic video settings. This challenging but important subject has mostly been ignored in the past due to several problems one of which is the lack of realistic and annotated video datasets. Our first contribut ..."
Abstract - Cited by 738 (48 self) - Add to MetaCart
contribution is to address this limitation and to investigate the use of movie scripts for automatic annotation of human actions in videos. We evaluate alternative methods for action retrieval from scripts and show benefits of a text-based classifier. Using the retrieved action samples for visual learning, we

Recognizing action at a distance

by Alexei A. Efros, Alexander C. Berg, Greg Mori, Jitendra Malik - PROCEEDINGS OF THE IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION , 2003
"... Our goal is to recognize human actions at a distance, at resolutions where a whole person may be, say, 30 pixels tall. We introduce a novel motion descriptor based on optical flow measurements in a spatio-temporal volume for each stabilized human figure, and an associated similarity measure to be us ..."
Abstract - Cited by 504 (20 self) - Add to MetaCart
-temporal motion descriptor. To classify the action being performed by a human figure in a query sequence, we retrieve nearest neighbor(s) from a database of stored, annotated video sequences. We can also use these retrieved exemplars to transfer 2D/3D skeletons onto the figures in the query sequence, as well

Dual-task interference in simple tasks: Data and theory

by Harold Pashler - Psychological Bulletin , 1994
"... People often have trouble performing 2 relatively simple tasks concurrently. The causes of this interference and its implications for the nature of attentional limitations have been controversial for 40 years, but recent experimental findings are beginning to provide some answers. Studies of the psy ..."
Abstract - Cited by 434 (12 self) - Add to MetaCart
of the psychological refractory period effect indicate a stubborn bottleneck encompassing the process of choosing actions and probably memory retrieval generally, together with certain other cognitive operations. Other limitations associated with task preparation, sensory-perceptual processes, and timing can generate

Human Action Retrieval via Spatio-temporal Cuboids

by Qingshan Luo, Guihua Zeng
"... An approach for human action retrieval in videos is pro-posed. Based on the volumetric analysis, actions in videos are represented by part-based cuboids. To make full use of the structural information, an explicit shape model (ESM) is designed for probabilistic Latent Semantic Analysis (pLSA). To en ..."
Abstract - Add to MetaCart
An approach for human action retrieval in videos is pro-posed. Based on the volumetric analysis, actions in videos are represented by part-based cuboids. To make full use of the structural information, an explicit shape model (ESM) is designed for probabilistic Latent Semantic Analysis (p

Learning semantic relationships for better action retrieval in images

by Vignesh Ramanathan, Congcong Li, Jia Deng, Wei Han, Zhen Li, Kunlong Gu, Yang Song, Samy Bengio, Chuck Rossenberg, Li Fei-fei - in Proc. IEEE Computer Society Conference on Computer Vision and Pattern Recognition , 2015
"... Human actions capture a wide variety of interactions between people and objects. As a result, the set of possi-ble actions is extremely large and it is difficult to obtain sufficient training examples for all actions. However, we could compensate for this sparsity in supervision by lever-aging the r ..."
Abstract - Cited by 2 (1 self) - Add to MetaCart
network framework which jointly ex-tracts the relationship between actions and uses them for training better action retrieval models. Our model incorpo-rates linguistic, visual and logical consistency based cues to effectively identify these relationships. We train and test our model on a largescale image

Modeling scene and object contexts for human action retrieval with few examples

by Yu-gang Jiang, Zhenguo Li, Shih-fu Chang - IEEE Trans Circuits & Sys Video Tech , 2011
"... Abstract—The use of context knowledge is critical for understanding human actions, which typically occur under particular scene settings with certain object interactions. For instance, driving car usually happens outdoors, and kissing involves two people moving toward each other. In this paper, we i ..."
Abstract - Cited by 4 (0 self) - Add to MetaCart
investigate the problem of context modeling for human action retrieval. We first identify ten simple object-level action atoms relevant to many human actions, e.g., people getting closer. With the action atoms and several background scene classes, we show that action retrieval can be improved through modeling

Relevance Feedback for Real World Human Action Retrieval

by Simon Jones , Ling Shao , Jianguo Zhang , Yan Liu - In: Intelligent Multimedia Interactivity , 2011
"... a b s t r a c t Content-based video retrieval is an increasingly popular research field, in large part due to the quickly growing catalogue of multimedia data to be found online. Even though a large portion of this data concerns humans, however, retrieval of human actions has received relatively li ..."
Abstract - Cited by 3 (1 self) - Add to MetaCart
a b s t r a c t Content-based video retrieval is an increasingly popular research field, in large part due to the quickly growing catalogue of multimedia data to be found online. Even though a large portion of this data concerns humans, however, retrieval of human actions has received relatively

Attention-Driven Action Retrieval with DTW-based 3D Descriptor Matching

by Rongrong Ji, Xiaoshuai Sun, Hongxun Yao, Pengfei Xu, Tianqiang Liu
"... From visual perception viewpoint, actions in videos can capture high-level semantics for video content understanding and retrieval. However, action-level video retrieval meets great challenges, due to the interferences from global motions or concurrent actions, and the difficulties in robust action ..."
Abstract - Cited by 2 (1 self) - Add to MetaCart
From visual perception viewpoint, actions in videos can capture high-level semantics for video content understanding and retrieval. However, action-level video retrieval meets great challenges, due to the interferences from global motions or concurrent actions, and the difficulties in robust action

Retrieving actions in movies

by Ivan Laptev, Patrick Pérez
"... We address recognition and localization of human actions in realistic scenarios. In contrast to the previous work studying human actions in controlled settings, here we train and test algorithms on real movies with substantial variation of actions in terms of subject appearance, motion, surrounding ..."
Abstract - Cited by 149 (7 self) - Add to MetaCart
We address recognition and localization of human actions in realistic scenarios. In contrast to the previous work studying human actions in controlled settings, here we train and test algorithms on real movies with substantial variation of actions in terms of subject appearance, motion, surrounding

The Bayesian image retrieval system, PicHunter: Theory, implementation, and psychophysical experiments

by Ingemar J. Cox, Matt L. Miller, Thomas P. Minka, Thomas V. Papathomas, Peter N. Yianilos - IEEE TRANSACTIONS ON IMAGE PROCESSING , 2000
"... This paper presents the theory, design principles, implementation, and performance results of PicHunter, a prototype content-based image retrieval (CBIR) system that has been developed over the past three years. In addition, this document presents the rationale, design, and results of psychophysica ..."
Abstract - Cited by 226 (2 self) - Add to MetaCart
This paper presents the theory, design principles, implementation, and performance results of PicHunter, a prototype content-based image retrieval (CBIR) system that has been developed over the past three years. In addition, this document presents the rationale, design, and results
Next 10 →
Results 1 - 10 of 1,470
Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University