CS4670/5670 - Computer Vision

Picture credit: xkcd

About this course

Humans are extremely good at perceiving the world from visual input alone. This comes so easily to us that we underestimate how difficult perception it is, and how hard it is for machines, as the webcomic above illustrates.
Computer vision is a subfield of AI focussed on getting machines to see as humans do, and has been around for almost half a century. This course will cover the basics of computer vision: the underlying mechanics of images, the core problems that the field focuses on, and the array of tools and techniques that have been developed. The emphasis will be on covering the fundamentals which underly both computer vision research and applications. A tentative list of topics is below:
  • Geometry / Physics of image formation
  • Properties of images and basic image processing
  • 3D reconstruction
  • Grouping (of image pixels into objects)
  • Machine learning in computer vision: basics, hand-designed feature vectors, convolutional networks
  • Detecting and localizing objects
A detailed but tentative list of learning outcomes can be found below. This course is intended for undergraduate students and MEng. students. Knowledge of basic probability and linear algebra will be useful. A primer on the aspects of linear algebra that will be useful is available here.

Quick info

Course Modality: Online, synchronous (for those in convenient time zones) and asynchronous (otherwise)
Canvas
Ed discussions (in lieu of Piazza)
Instructor: Bharath Hariharan
Lecture time: MWF 1:30pm - 2:20 pm
Lecture zoom
TAs:
  • Aaron Gokolsan
  • Albert Tsao
  • Ankush Rayabhari
  • Ashneel Das
  • Clara Song
  • Dongqing Wang
  • Jason Pan
  • Joshua Loomasney
  • Julie Phan
  • Kabir Walia
  • Shreeya Gad
  • Yiheng Dong
Instructor Office Hours: (TBD)
All Office Hours:(TBD)



Learning outcomes Resources Grading policy