Learning Visual Representations Using Images with Captions (2007)

by A Quattoni, M Collins, T Darrell
Venue:In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition