
Syllabus
 08/24: Introduction [slides]
 Examples of machine learning problems the require counterfactual reasoning.
 Overview of course.
 Administrative issues and course policies.
 08/31: The Counterfactual Model for Learning Systems. [slides]
 Background: Imbens, Rubin, Causal Inference for Statistical Social Science, 2015. Chapters 1,3,12. (online via Cornell Library)
 09/07: Basics of online and offline estimation.
 The Counterfactual Model for Learning Systems (continued). [Thorsten Joachims]
 R. Kohavi, R. Longbotham, D. Sommerfield, and R. M. Henne. Controlled experiments on the web: survey and practical guide. Data Mining and Knowledge Discovery, pages 140181, 2009. (paper) [Briana Vecchione]
 09/14: Doublyrobust estimator.
 M. Dudik, J. Langford, and L. Li. Doubly robust policy evaluation and learning. In ICML, pages 10971104, 2011. (paper) [Lequn Wang]
 M. Farajtabar, Yinlam Chow, M. Ghavamzadeh. More Robust Doubly Robust Offpolicy Evaluation. In ICML, 2018. (paper) [Xiaojie Mao]
 09/21: Combination estimators.
 YuXiang Wang, Alekh Agarwal and Miro Dudik. Optimal and Adaptive OffPolicy Evaluation in Contextual Bandits. In ICML, 2017. (paper) [Yi Su]
 Philip Thomas, Emma Brunskill. DataEfficient OffPolicy Policy Evaluation for Reinforcement Learning. In ICML, 2016. (paper) [Yi Su]
 09/28: Recommender evaluation.
 A. Gilotte, C. Calauzenes, T. Nedelec, A. Abraham and S. Dolle. Offline A/B testing for recommender systems. In WSDM, 2018. (paper) [Shachi Deshpande]
 L. Yang, Y. Cui, Y. Xuan, C. Wang, S. Belongie, D. Estrin. Unbiased Offline Recommender Evaluation for MissingNotAtRandom Implicit Feedback. In RecSys, 2018. (paper) [Longqi Yang]
 D. Liang, L. Charlin, D. Blei. Causal Inference for Recommendation. In UAI Workshop, 2016. (paper) [Longqi Yang]
 10/05: Extensions to offline evaluation.
 Alex Strehl, John Langford, Sham Kakade, Lihong Li. Learning from Logged Implicit Exploration Data. NIPS, pages 22172225, 2010. (paper) [Chengrun Yang]
 L. Bottou, J. Peters, J. Q. Candela, D. X. Charles, M. Chickering, E. Portugaly, D. Ray, P. Y. Simard, and E. Snelson. Counterfactual reasoning and learning systems: The example of computational advertising. Journal of Machine Learning Research, 14(1):32073260, 2013. (paper) [Katherine van Koevering]
 10/12: Batch learning from bandit feedback (BLBF). [slides]
 A. Swaminathan, T. Joachims, Batch Learning from Logged Bandit Feedback through Counterfactual Risk Minimization, JMLR Special Issue in Memory of Alexey Chervonenkis, 16(1):17311755, 2015. (paper) [Thorsten Joachims]
 T. Joachims, A. Swaminathan, M. de Rijke. Deep Learning with Logged Bandit Feedback. In ICLR, 2018. (paper) [Thorsten Joachims]
 10/19: Propensity overfitting and dealing with large action spaces.
 A. Swaminathan and T. Joachims. The selfnormalized estimator for counterfactual learning. In NIPS, pages 32133221, 2015. (paper) [Thorsten Joachims]
 N. Kallus and A. Zhou. Policy Evaluation and Optimization with Continuous Treatments. In AISTATS, 2018. (paper) [Angela Zhou]
 10/26: Error bounds and learning to rank with partial feedback. [slides]
 C. Cortes, Y. Mansour, and M. Mohri. Learning bounds for importance weighting. In NIPS, pages 442450, 2010. (paper) [Kate Donahue]
 T. Joachims, A. Swaminathan, T. Schnabel, Unbiased LearningtoRank with Biased Feedback, In WSDM, 2017. (paper) [Thorsten Joachims]
 11/02: Propensity estimation for learning to rank.
 X Wang, N Golbandi, M Bendersky, D Metzler, M. Najork. Position Bias Estimation for Unbiased Learning to Rank in Personal Search. In WSDM, 2018. (paper) [Aman Agarwal]
 A. Agarwal, I. Zaitsev, Xuanhui Wang, Cheng Li, M. Najork, T. Joachims. Estimating Position Bias without Intrusive Interventions. To appear in WSDM, 2019 (paper) [Aman Agarwal]
 11/09: Embeddings and observational data.
 S. Bonner, F. Vasile. Causal Embeddings for Recommendation. Arxiv, 2018. (paper) [Ashudeep Singh]
 N. Kallus. Balanced Policy Evaluation and Learning. Arxiv, 2017. (paper) [Angela Zhou]
 11/16: Treebased policy learning.
 S. Athey and G. Imbens. Recursive Partitioning for Heterogeneous Causal Effects. PNAS, 112(27):73537360, 2015. (paper) [Cheng Perng Phoo]
 11/23: Thanksgiving
 11/30: Wrapup


Reference Material
We will mostly read original research papers, but the following books and tutorials provide entry points for the main topics of the class:
 Imbens, Rubin, "Causal Inference for Statistics, Social, and Biomedical Sciences", Cambridge University Press, 2015. (online via Cornell Library)
 Morgan, Winship "Counterfactuals and Causal Inference", Cambridge University Press, 2007.
 T. Joachims, A. Swaminathan. SIGIR Tutorial on Counterfactual Evaluation and Learning for Search, Recommendation and Ad Placement, 2016. (homepage)
Other sources for general background on machine learning are:
 Kevin Murphy, "Machine Learning  a Probabilistic Perspective", MIT Press, 2012. (online via Cornell Library)
 Schoelkopf, Smola, "Learning with Kernels", MIT Press, 2001. (online)
 Bishop, "Pattern Recognition and Machine Learning", Springer, 2006.
 Tom Mitchell, "Machine Learning", McGraw Hill, 1997.
 Ethem Alpaydin, "Introduction to Machine Learning", MIT Press, 2004.
 Devroye, Gyoerfi, Lugosi, "A Probabilistic Theory of Pattern Recognition", Springer, 1997.
 Duda, Hart, Stork, "Pattern Classification", Wiley, 2000.
 Hastie, Tibshirani, Friedman, "The Elements of Statistical Learning", Springer, 2001.
 Vapnik, "Statistical Learning Theory", Wiley, 1998.
