CS6670 - Computer Vision

Picture credit: xkcd

About this course

Humans are extremely good at perceiving the world from visual input alone. This comes so easily to us that we underestimate how difficult perception it is, and how hard it is for machines, as the webcomic above illustrates.
Computer vision is a subfield of AI focussed on getting machines to see as humans do. We have been at it for almost half a century now, and while the problem is still far from getting solved, we have made tremendous progress. The past decade has especially been a revolution in the making. This course will cover these ideas, both the classic work based on geometry and physics, as well as the new ones based on convolutional networks and deep learning.

Quick info

Instructor: Bharath Hariharan
Lecture time: Tues. and Thurs. 1:25pm - 2:40pm
Lecture venue: Gates G01
TA: TBD

Office Hours:
Bharath: Tue and Thur 2:45-3:45 pm (at 311 Gates Hall)

Intended audience and prerequisites:

This course is intended for PhD students. As such, it will assume familiarity and comfort with mathematics in general and linear algebra and probability and statistics in particular. This intended audience also means that there will be a strong focus on teaching research abilities (see below for what this means).
If you are a PhD student and you find the course full, please send me a note and I will help you enroll. Undergraduate students are strongly encouraged to first take the undergraduate course (CS 4670/5670).



Learning outcomes Calendar and lecture notes Resources Grading policy List of projects