Publications - Yanbang Wang

Gemini-SQL2: Model, Harness, and System Design

Yanbang Wang, Qitian Wu, Sami Abu-el-Haija, Mohammadreza Pourreza, Michael Galkin, Hadi Hemmati, Hailong Li, Yeounoh Chung, Fatma Ozcan, Bryan Perozzi, Vahab Mirrokni

Preprint (under review) 2026

Gemini-SQL2 is currently the best coding LLM for text-to-SQL in the world. Gemini-SQL2 is Gemini 3.1 Pro post-trained and serves in a dedicated agentic harness. It currently ranks #1 on the BIRD leaderboard which is the de facto standard for text-to-SQL tasks.

[Google's Announcement] [VP's Repost] [BIRD Leaderboard]

Gemini-SQL2: Model, Harness, and System Design

Yanbang Wang, Qitian Wu, Sami Abu-el-Haija, Mohammadreza Pourreza, Michael Galkin, Hadi Hemmati, Hailong Li, Yeounoh Chung, Fatma Ozcan, Bryan Perozzi, Vahab Mirrokni

Preprint (under review) 2026

Gemini-SQL2 is currently the best coding LLM for text-to-SQL in the world. Gemini-SQL2 is Gemini 3.1 Pro post-trained and serves in a dedicated agentic harness. It currently ranks #1 on the BIRD leaderboard which is the de facto standard for text-to-SQL tasks.

[Google's Announcement] [VP's Repost] [BIRD Leaderboard]

Negative Sampling From the Ground Up

Yanbang Wang, Jon Kleinberg, Yanhong Wu

International Conference on Machine Learning (ICML) 2026

We revisit negative sampling for recommender systems from first principles and propose a redesign that improves recommendation quality.

2026

Gemini-SQL2: Model, Harness, and System Design

Gemini-SQL2: Model, Harness, and System Design

Negative Sampling From the Ground Up

Negative Sampling From the Ground Up

Graph-Language Models as Text-to-SQL Verifier

Graph-Language Models as Text-to-SQL Verifier

2025

Microstructures and Accuracy of Graph Recall by Large Language Models

Microstructures and Accuracy of Graph Recall by Large Language Models

Network Authentication Evaluation

Network Authentication Evaluation

2024

Network Recall by Large Language Models

Network Recall by Large Language Models

On the Relationship Between Relevance and Conflict in Online Social Link Recommendations

On the Relationship Between Relevance and Conflict in Online Social Link Recommendations

From Graphs to Hypergraphs: Hypergraph Projection and its Reconstruction

From Graphs to Hypergraphs: Hypergraph Projection and its Reconstruction

2023

On the Relationship Between Relevance and Conflict in Online Social Link Recommendations

On the Relationship Between Relevance and Conflict in Online Social Link Recommendations

A Graph-based Framework for Reducing False Positives in Authentication Alerts in Security Systems

A Graph-based Framework for Reducing False Positives in Authentication Alerts in Security Systems

2022

Algorithm and System Co-design for Efficient Subgraph-based Graph Representation Learning

Algorithm and System Co-design for Efficient Subgraph-based Graph Representation Learning

2021

Inductive Representation Learning in Temporal Networks via Causal Anonymous Walks

Inductive Representation Learning in Temporal Networks via Causal Anonymous Walks

TEDIC: Neural Modeling of Behavioral Patterns in Dynamic Social Interaction Networks

TEDIC: Neural Modeling of Behavioral Patterns in Dynamic Social Interaction Networks

Revisiting Graph Neural Networks and Distance Encoding in a Practical View

Revisiting Graph Neural Networks and Distance Encoding in a Practical View

2020

Distance Encoding: Design Provably More Powerful Neural Networks for Graph Representation Learning

Distance Encoding: Design Provably More Powerful Neural Networks for Graph Representation Learning

A Network-based Method for Estimating Potential for Career Advancement from Incomplete Data

A Network-based Method for Estimating Potential for Career Advancement from Incomplete Data

Generic Representation Learning for Dynamic Social Interaction

Generic Representation Learning for Dynamic Social Interaction

EmotionCues: Emotion-Oriented Visual Summarization of Classroom Videos

EmotionCues: Emotion-Oriented Visual Summarization of Classroom Videos

2019

Transfer Learning using Representation Learning in Massive Open Online Courses

Transfer Learning using Representation Learning in Massive Open Online Courses

Using Detailed Access Trajectories for Learning Behavior Analysis

Using Detailed Access Trajectories for Learning Behavior Analysis