I am a final-year Ph.D. candidate in Computer Science at Cornell University, extremely fortunate to be advised by Prof. Jon Kleinberg. I am currently visiting Stanford Computer Science as an IvyPlus Scholar for the 2025–26 academic year. I am also affiliated with Google Research as a Student Researcher.
I obtained my M.S. in Computer Science from Stanford University, where I worked with Prof. Jure Leskovec. I received my B.S. in Computer Science and Mathematics, graduating Summa Cum Laude (top 1%), from the Hong Kong University of Science and Technology.
My recent research centers on LLMs for code generation, building world-leading foundation models and agentic harnesses that top industry-standard coding benchmarks. At Google, I created and first-authored Gemini-SQL2, currently the best LLM for SQL generation in the world. Post-trained from Gemini 3.1 Pro and served in a dedicated agentic harness, Gemini-SQL2 ranks #1 on the highly competitive, industry-standard BIRD benchmark, scoring 80.04 on the held-out test set in BIRD’s independent evaluation — a significant margin over all competitors, including submissions from OpenAI, Anthropic, and AWS.
More broadly, I study how to reason, retrieve, and generate over structure-rich text and context by explicitly exploiting the underlying structure. Representative directions include:
My work has drawn broad recognition and global attention: Gemini-SQL2’s launch announcement is the single most-liked post from Google Research on X over the past year. I also serve as a General Chair of the Learning on Graphs (LoG) Conference 2025 and 2026.








Yanbang Wang, Qitian Wu, Sami Abu-el-Haija, Mohammadreza Pourreza, Michael Galkin, Hadi Hemmati, Hailong Li, Yeounoh Chung, Fatma Ozcan, Bryan Perozzi, Vahab Mirrokni
Preprint (under review) 2026
Gemini-SQL2 is currently the best coding LLM for text-to-SQL in the world. Gemini-SQL2 is Gemini 3.1 Pro post-trained and serves in a dedicated agentic harness. It currently ranks #1 on the BIRD leaderboard which is the de facto standard for text-to-SQL tasks.
Yanbang Wang, Qitian Wu, Sami Abu-el-Haija, Mohammadreza Pourreza, Michael Galkin, Hadi Hemmati, Hailong Li, Yeounoh Chung, Fatma Ozcan, Bryan Perozzi, Vahab Mirrokni
Preprint (under review) 2026
Gemini-SQL2 is currently the best coding LLM for text-to-SQL in the world. Gemini-SQL2 is Gemini 3.1 Pro post-trained and serves in a dedicated agentic harness. It currently ranks #1 on the BIRD leaderboard which is the de facto standard for text-to-SQL tasks.

Yanbang Wang, Hejie Cui, Jon Kleinberg
Neural Information Processing Systems (NeurIPS) 2025
The first systematic study of how LLMs memorizes structural information in text. We find LLMs often underperform and are biased towards certain error patterns, and that stronger models memorizes better when the structures are narrated in a domain-consistent style.
Yanbang Wang, Hejie Cui, Jon Kleinberg
Neural Information Processing Systems (NeurIPS) 2025
The first systematic study of how LLMs memorizes structural information in text. We find LLMs often underperform and are biased towards certain error patterns, and that stronger models memorizes better when the structures are narrated in a domain-consistent style.

Yanbang Wang, Jon Kleinberg, Yanhong Wu
International Conference on Machine Learning (ICML) 2026
We revisit negative sampling for recommender systems from first principles and propose a redesign that improves recommendation quality.
Yanbang Wang, Jon Kleinberg, Yanhong Wu
International Conference on Machine Learning (ICML) 2026
We revisit negative sampling for recommender systems from first principles and propose a redesign that improves recommendation quality.

Yanbang Wang, Yen-Yu Chang, Yunyu Liu, Jure Leskovec, Pan Li
International Conference on Learning Representations (ICLR) 2021
Causal Anonymous Walks (CAWs) automatically retrieve temporal network motifs to represent network dynamics and use an anonymization strategy that keeps the method inductive, achieving SOTA on transductive and inductive temporal link prediction.
Yanbang Wang, Yen-Yu Chang, Yunyu Liu, Jure Leskovec, Pan Li
International Conference on Learning Representations (ICLR) 2021
Causal Anonymous Walks (CAWs) automatically retrieve temporal network motifs to represent network dynamics and use an anonymization strategy that keeps the method inductive, achieving SOTA on transductive and inductive temporal link prediction.

Yanbang Wang, Jon Kleinberg
Neural Information Processing Systems (NeurIPS) 2023
One of the first rigorous analyses of how link recommendations that boost engagement can also escalate conflict and polarization, using the Friedkin–Johnsen model of opinion dynamics.
Yanbang Wang, Jon Kleinberg
Neural Information Processing Systems (NeurIPS) 2023
One of the first rigorous analyses of how link recommendations that boost engagement can also escalate conflict and polarization, using the Friedkin–Johnsen model of opinion dynamics.

Yanbang Wang, Jon Kleinberg
International Conference on Learning Representations (ICLR) 2024
We study the consequences of representing higher-order systems as graphs rather than hypergraphs, characterizing the information lost in hypergraph projection and proposing a learning-based method to reconstruct the original higher-order relations.
Yanbang Wang, Jon Kleinberg
International Conference on Learning Representations (ICLR) 2024
We study the consequences of representing higher-order systems as graphs rather than hypergraphs, characterizing the information lost in hypergraph projection and proposing a learning-based method to reconstruct the original higher-order relations.

Yanbang Wang, Pan Li, Chongyang Bai, Jure Leskovec
The Web Conference (WebConf) 2021
TEDIC learns representations on dynamic social interaction networks by diffusing node attributes over a network and its complement and applying temporal convolutions, outperforming prior methods across four social-character prediction tasks.
Yanbang Wang, Pan Li, Chongyang Bai, Jure Leskovec
The Web Conference (WebConf) 2021
TEDIC learns representations on dynamic social interaction networks by diffusing node attributes over a network and its complement and applying temporal convolutions, outperforming prior methods across four social-character prediction tasks.

Pan Li, Yanbang Wang, Hongwei Wang, Jure Leskovec
Neural Information Processing Systems (NeurIPS) 2020
Distance Encoding (DE) is a general class of structure-related features that provably gives GNNs more expressive power than the 1-Weisfeiler–Lehman test, distinguishing node sets in almost all regular graphs where traditional GNNs fail.
Pan Li, Yanbang Wang, Hongwei Wang, Jure Leskovec
Neural Information Processing Systems (NeurIPS) 2020
Distance Encoding (DE) is a general class of structure-related features that provably gives GNNs more expressive power than the 1-Weisfeiler–Lehman test, distinguishing node sets in almost all regular graphs where traditional GNNs fail.