Victoria Dean

vdean@mit.edu

Scholar Title

MIT EECS — Cisco Undergraduate Research and Innovation Scholar

Research Title

Understanding Video with Deep Convolutional Neural Networks

Cohort

2015–2016

Supervisor

Antonio Torralba

Abstract

Deep convolutional neural networks (CNNs) have drastically improved the state of the art for image recognition and have also shown success in finer grained vision tasks, such as localization and image captioning. However, video processing with CNNs is a less explored area, since deep learning proves most successful when large amounts of labeled data are available, and labeling multiple frames from a video is a time consuming task. I hope to utilize the enormous amounts of unlabeled
video to understand video and predict what will happen next. In addition, once our models are trained, I want to examine what our models have learned about videos and the world.

Quote

I first became interested in computer vision during high school, when I gave basket-tracking a shot for a basketball-playing robot. I really enjoyed the inference class I took last year (6.008), and I’m looking forward to applying deep neural nets to video.

Back to Scholars