CS4670/5670 - Computer Vision

Picture credit: xkcd

About this course

Humans are extremely good at perceiving the world from visual input alone. This comes so easily to us that we underestimate how difficult perception it is, and how hard it is for machines, as the webcomic above illustrates.
Computer vision is a subfield of AI focussed on getting machines to see as humans do, and has been around for almost half a century. This course will cover the basics of computer vision: the underlying mechanics of images, the core problems that the field focuses on, and the array of tools and techniques that have been developed. The emphasis will be on covering the fundamentals which underly both computer vision research and applications. A tentative list of topics is below:
  • Geometry / Physics of image formation
  • Properties of images and basic image processing
  • 3D reconstruction
  • Grouping (of image pixels into objects)
  • Machine learning in computer vision: basics, hand-designed feature vectors, convolutional networks
  • Detecting and localizing objects
A detailed but tentative list of learning outcomes can be found below. This course is intended for undergraduate students and MEng. students. Knowledge of basic probability and linear algebra will be useful.

Quick info

Instructor: Bharath Hariharan
Lecture time: MWF 1:25pm - 2:15pm
Lecture venue: Hollister B14 Ives 305
TAs:
  • Rohit Bandaru
  • Dhruv Baijal
  • Paul deVito
  • Eashan Garg
  • Shantanu Gore
  • Eashaan Kumar
  • Yuntian Lan
  • Laura Lin
  • Suzie Petryk
  • Nehal Rawat
  • Zikai Wen
Office Hours:



Learning outcomes Calendar and lecture notes Resources Grading policy