Course Description
An introductory course in machine learning, with a focus on data modeling and related methods and learning algorithms for data sciences. Tentative topic list:
- Dimensionality reduction, such as principal component analysis (PCA) and the singular value decomposition (SVD), canonical correlation analysis (CCA), independent component analysis (ICA), compressed sensing, random projection, the information bottleneck. (We expect to cover some, but probably not all, of these topics).
- Clustering, such as k-means, Gaussian mixture models, the expectation-maximization (EM) algorithm, link-based clustering. (We do not expect to cover hierarchical or spectral clustering.).
- Probabilistic-modeling topics such as graphical models, latent-variable models, inference (e.g., belief propagation), parameter learning.
Can be taken independently or in any order with CS4780/5780 (Machine Learning for Intelligent Systems).
Prerequisites: probability theory (BTRY 3080, ECON 3130, MATH 4710, or strong performance in ENGRD 2700 or equivalent); linear algebra (MATH 2940 or equivalent); CS2110 or equivalent programming proficiency.
News (see also announcements on lecture handouts)
- Competition I has been posted!
- Friday, September 22nd Assignment 3 is out. It is due on 29th september.
- Friday, September 13th Assignment 2 is out. It is due on 20th september.
- Friday, September 2nd Assignment 1 is out. It is due on 9th september.
- Tuesday, August 30th Python version of demo for lecture 2 added to the zip file (see lectures tab).
- Tuesday, August 23rd: The diagnostic assignment is out! Due on August 30th