Multilevel integration of vision and speech understanding using bayesian networks (1999)

by S Wachsmuth, H Brandt-Pook, G Socher, F Kummert, G Sagerer
Venue:Computer Vision Systems: First Int. Conf