Video content parsing based on combined audio and visual information (1999)

by T Zhang, C-C Kuo
Venue:in Proc. SPIE 1999