Congzheng Song / 宋丛峥

Curriculum Vitae [pdf]


Email: cs2296[at]
Links: [Google scholar ] [Github ] [Linkedin ]

About Me

Hello! I am a Computer Science Ph.D. candidate at Cornell University (physically located at Cornell Tech) working with Prof. Vitaly Shmatikov. My current research interests are security & privacy issues in machine learning. I completed my bachelor's degree at Emory University, where I worked closely with Prof. Ymir Vigfusson and Prof. Lee Cooper on some fun real world deep learning application projects.

Industrial Experience

Research intern at Google Brain, August 2019 - December 2019
Research intern at Petuum Inc, May 2019 - August 2019


(* indicates equal contribution)

  1. Generalized Zero-shot ICD Coding [pdf]
    C.Song, S.Zhang, N.Sadoughi, P.Xie, E.P.Xing
    In International Joint Conference on Artificial Intelligence (IJCAI), 2020

  2. Robust Membership Encoding: Inference Attacks and Copyright Protection for Deep Learning [pdf]
    C.Song, R.Shokri
    In ACM ASIA Conference on Computer and Communications Security (AsiaCCS), 2020

  3. Overlearning Reveals Sensitive Attributes [pdf][code][slides]
    C.Song, V.Shmatikov
    In International Conference on Learning Representation (ICLR), 2020

  4. Auditing Data Provenance in Text-Generation Models [pdf][code][slides]
    C.Song, V.Shmatikov
    In ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), 2019
    Oral Presentation

  5. Exploiting Unintended Feature Leakage in Collaborative Learning [pdf][code][talk][slides]
    L.Melis*, C.Song*, E. De Cristofaro, V.Shmatikov
    In IEEE Symposium on Security and Privacy (Oakland), 2019

  6. What Are Machine Learning Models Hiding? [pdf]
    V.Shamtikov, C.Song
    In Workshop on Hot Topics in Privacy Enhancing Technologies (HotPETs), 2018

  7. Kernel Distillation for Fast Gaussian Processes Prediction [pdf][code]
    C.Song*, Y.Sun*
    In NeurIPS Workshop on All of Bayesian Nonparametrics (BNP@NeurIPS), 2018
    Spotlight Presentation

  8. Predicting Clinical Outcomes from Large Scale Cancer Genomic Profiles with Deep Survival Models [pdf][code]
    S.Yousefi, F.Amrollahi, M.Amgad, C.Dong, J.E.Lewis, C.Song, D.A.Gutman, S.H.Halani, J.E.V.Vega, D.J.Brat, L.A.D.Cooper
    In Scientific Reports 7 (Nature), 2017

  9. Machine Learning Models that Remembers Too Much [pdf][code][talk][slides]
    C.Song, T.Risternpart, V.Shmatikov
    In ACM Conference on Computer and Communications Security (CCS), 2017

  10. Membership Inference Attacks Against Machine Learning Models [pdf][code][talk]
    R.Shokri, M.Stronati, C.Song, V.Shmatikov
    In IEEE Symposium on Security and Privacy (Oakland), 2017
    The Caspar Bowden Award for Outstanding Research in Privacy Enhancing Technologies 2018

  11. Learning Genomic Representations to Predict Clinical Outcomes in Cancer [pdf][code]
    S.Yousefi, C.Song, N.Nauata, L.Cooper
    In International Conference on Learning Representation Workshop (ICLRW), 2016


  1. Information Leakage in Emebedding Models [pdf]
    C.Song, A.Raghunathan
    In arXiv preprint, 2020

  2. Chiron: Privacy-preserving Machine Learning as a Service [pdf]
    T.Hunt, C.Song, R.Shokri, V.Shmatikov, E.Witchel
    In arXiv preprint, 2018

  3. Fooling OCR Systems with Adversarial Text Images [pdf][code by F.Tramèr et al]
    C.Song, V.Shmatikov
    In arXiv preprint, 2018