| Week | Date | Notes, Readings, and HW | 
    
    
    
      
      | 1 | Tue, Feb 09 | Introduction | 
    
    
      
      |  | Thu, Feb 11 | Optimization and linear algebra refresher | 
    
    
      
      | 2 | Tue, Feb 16 | Regularized linear least squares | 
    
    
      
      |  | Thu, Feb 18 | Sparse least squares and iterations | 
    
    
      
      | 3 | Tue, Feb 23 | Stochastic gradients, scaling, and Newton | 
    
    
      
      |  | Thu, Feb 25 | Randomized numerical linear algebra 
          Finding Structure with Randomness: Probabilistic Algorithms for Constructing Approximate Matrix Decompositions, Halko, Martinsson, and Tropp, SIREV, 2011.LSRN: A Parallel Iterative Solver for Strongly Over- or Under-Determined Systems, Meng, Saunders, Mahoney, SISC 2014Sec 5, Lectures on Randomized NLA, Drineas and Mahoney | 
    
    
      
      | 4 | Tue, Mar 02 | Latent factor models | 
    
    
      
      |  | Thu, Mar 04 | SVD and other low-rank decompositions 
          On the relationships between SVD, KLT, and PCA, Gerbrands, Pattern Recognition, 1981Trace optimization and eigenproblems in dimension reduction methods, Kikiopoulou, Chen, and Saad, NLAA 2010On the compression of low rank matrices, Cheng, Gimbutas, Martinsson, and Rokhlin, SISC 2005CUR matrix decompositions for improved data analysis, Mahoney and Drineas, PNAS 2009Meeting notes | 
    
    
      
      | 5 | Tue, Mar 09 | Wellness day | 
    
    
      
      |  | Thu, Mar 11 | Non-negative matrix factorization 
          Nonnegative Matrix Factorization (Gillis), Chapter 1The Whys and Hows of NMF, GillisLearning the parts of objects by non-negative matrix factorization, Lee and Seung, Nature, 1999Computing a nonnegative matrix factorization – provably, Arora, Ge, Kannan, and Moitra, SICOMP, 2016When Does NMF Give a Correct Decomposition into Parts?, Donoho and Stodden, NeurIPS, 2003Algorithms for NMF and NTFs: a unified view based on block coordinate descent framework, Kim, He, and Park, J. Glob. Optim, 20113Meeting notes | 
    
    
      
      | 6 | Tue, Mar 16 | Tensor basics, HOSVD, Tucker, and ALS 
          Tensor Decompositions and Applications, Kolda and Bader, SIREV, 2009Tensor Computations and Applications in Data Mining, Elden, slides from SIAM AM 2008From Matrix to Tensor, Van Loan, slides from Cornell CS colloquiumTensors for Data Mining and Data Fusion, Papalexakis, Faloutsos, and Sidriropoulos, ACM TIS, 2016Meeting notes | 
    
    
      
      |  | Thu, Mar 18 | CP decomposition and algorithms, CUR and tensor trains 
          Tensor Decompositions and Applications, Kolda and Bader, SIREV, 2009Tensor Decompositions: A Mathematical Tool for Data Analysis, Kolda, slides from JMM 2018Epsilon-ALS for Orthogonal Low-Rank Tensor Approximation, Yang, SIMAX 2020Low Multilinear Rank Approximations of Tensors, Che, Wei, and Yan, SIMAX 2020Low-Rank Approximation in the Frobenius Norm by Column and Row Subset Selection, Cortinovis and Kressner, SIMAX 2020Stochastic Gradients for Large-Scale Tensor Decomposition, Kolda and Hong, SIMODS 2020Exercise notebook | 
    
    
      
      | 7 | Tue, Mar 23 | Nonlinear dimensionality reduction 
          A global geometric framework for nonlinear dimensionality reduction, Tenenbaum, de Silva, and Langford, Science 2000Nonlinear dimensionality reduction by locally linear embedding, Roweis and Saul, Science 2000Visualizing Data using t-SNE, van der Maaten and Hinton, JMLR 2008Dimensionality Reduction: A Comparitive Review, van der Maaten, Postma, and van den Herik, Tech report 2009Dimension Reduction: A Guided Tour, Burges, FTML 2009Global versus local methods in nonlinear dimensionality reduction, de Silva and Tenenbaum, NeurIPS 2003Large-scale SVD and manifold learning, Talwalkar, Kumar, Mohri, and Rowley, JMLR 2013Accelerating t-SNE using tree-based algorithms, van der Maaten, JMLR 2014 | 
    
    
      
      |  | Thu, Mar 25 | Function approximation fundamentals 
          Nonlinear Approximation, DeVore, Acta Numerica 1998 - long, but please do read sections 1 and 9 at leastApproximation Theory and Approximation Practice, Trefethen, SIAM 2019 - a beautiful text, focused on polynomial and rational approximation in 1D; useful to skim, don’t consider it assigned readingA Course in Approximation Theory, Cheney and Light, AMS 2009 - again, not considered assigned reading (unless you want to do DNN approximation, in which case please read ch 23-25)Class notebook | 
    
    
      
      | 8 | Tue, Mar 30 | Low-dim structure in function approximation 
          Active Subspaces: Emerging Ideas for Dimension Reduction in Parameter Studies, Constantine, SIAM 2015Active Subspace Methods in Theory and Practice: Applications to Kriging Surfaces, Constantine, Dow, and Wang, SISC 2014Active Manifolds: a non-linear analogue to Active Subspaces, Bridges, Gruber, Felder, Verma, Hoff, ICML 2019Constrained global optimization of functions with low effective dimensionality using multiple random embeddings, Cartis, Massart, Otemissov, arXiv 2020 | 
    
    
      
      |  | Thu, Apr 01 | Low-dim structure in function approximation 
          Approximation of high-dimensional parametric PDEs, Cohen, DeVore, Acta Numerica 2015Model reduction via proper orthogonal decomposition, Pinnau, in Model Order Reduction: Theory, Research Aspects and Applications, Springer 2008Nonlinear model reduction via discrete empirical interpolation, Chaturantabut, Sorensen, SISC 2010Class notebook | 
    
    
      
      | 9 | Tue, Apr 06 | Many interpretations of kernels 
          ESL, sec 14.5.4Kernel techniques: From machine learning to meshless methods, Schaback and Wendland, Acta Numerica 2006Gaussian Processes for Machine Learning, Rasumussen and Williams, 2006 - read Ch 1Kernel Methods in ML, Hoffman, Scholkopf, Smola, Annals of Statistics, 2008Spline Models for Observational Data, Wahba, SIAM 1990 - read the foreword in particular | 
    
    
      
      |  | Thu, Apr 08 | Approaches to kernel selection | 
    
    
      
      | 10 | Tue, Apr 13 | Computing with kernels | 
    
    
      
      |  | Thu, Apr 15 | Scalable kernel methods 
          Kernel Interpolation for Scalable Structured GPs, Wilson and Nickisch, ICML 2015Scalable Log Determinants for GP Kernel Learning, Eriksson et al, NeurIPS 2017Scaling GP Regression with Derivatives, Dong et al, NeurIPS 2018Exact GPs on a Million Data Points, Wang et al, NeurIPS 2019Fast estimation of tr(f(A)) via stochastic Lanczos quadrature, Ubaru, Chen, and Saad, SIMAX 2017Meeting notes | 
    
    
      
      | 11 | Tue, Apr 20 | Matrices associated with graphs | 
    
    
      
      |  | Thu, Apr 22 | Function approximation on graphs 
          Semi-Supervised Learning Using Gaussian Fields and Harmonic Functions, Zhu, Gharhraman, and Lafferty, ICML 2003Learning with Local and Global Consistency, Zhou, NeurIPS 2004Empirical stationary correlations for semi-supervised learning on graphs, Xu, Dyer, and Owen, Ann Appl Stat, 2010Using Local Spectral Methods to Robustify Graph-Based Learning Algorithms, Gleich and Mahoney, KDD 2015 | 
    
    
      
      | 12 | Tue, Apr 27 | Graph clustering and partitioning 
          A tutorial on spectral clustering, von Luxburg, Statistics and Computing 2007Communities in networks, Porter, Onnela, and Mucha, Notices of the AMS, 2009Community detection in networks: A user guide, Fortunato and Hric, Physics Reports, 2016Trace optimization and eigenproblems in dimension reduction methods, Kokiopoulou, Chen, and Saad, NLAA, 2011 | 
    
    
      
      |  | Thu, Apr 29 | Centrality measures | 
    
    
      
      | 13 | Tue, May 04 | Learning linear system dynamics | 
    
    
      
      |  | Thu, May 06 | Learned dynamics and extrapolation | 
    
    
      
      | 14 | Tue, May 11 | Koopman theory and lifting | 
    
    
      
      |  | Thu, May 13 | Learning nonlinear dynamics 
          Discovering governing equations from data by sparse identification of nonlinear dynamical systems, Brunton, Proctor, Kutz, PNAS 2016A Data-Driven Approximation of the Koopman Operator: Extending Dynamic Mode Decomposition, Williams, Kevrekidis, Rowley, J Nonlinear Science 2015A Kernel-Based Method for Data-Driven Koopman Spectral Analysis, Williams, Rowley, Kevrekidis, J Comp Dynamics 2015Class notebook |