Machine learning is revolutionizing our world: computers can recognize images, translate language, and even play games competitively with humans. However, there is a missing piece that is necessary for computers to take actions in the real world. My research studies Predictive Vision with the goal of anticipating the future events that may happen. To tackle this challenge, I present predictive vision algorithms that learn directly from large amounts of raw, unlabeled data. Capitalizing on millions of natural videos, my work develops methods for machines to learn to anticipate the visual future, forecast human actions, and recognize ambient sounds. Predictive vision provides a framework for learning from data to simulate possible events, enabling new applications across health, graphics, and robotics. 

Carl Vondrick is a doctoral candidate at the Massachusetts Institute of Technology where he researches and develops computer vision and machine learning technology. His research was awarded the Google PhD Fellowship, the NSF Graduate Fellowship, and is widely featured in popular press, such as NPR, CNN, the Associated Press, and the Late Show with Stephen Colbert.