What’s cookin’? interpreting cooking videos using text, speech and vision. (2015)

by Jonathan Malmaud, Jonathan Huang, Vivek Rathod, Nick Johnston, Andrew Rabinovich, Kevin Murphy