Yiwei Bai

Ph.D. Student
CS Department@Cornell University

Office: 344, Gates Hall, Cornell, Ithaca, NY 14853
Email: yb263 [at] cornell (dot) edu

Biography

Hi! I am a CS Ph.D. student at Cornell University, where I am supervised by professor Carla P. Gomes. My research interests lie in the intersection of reinforcement learning, decision making and computational sustainability. I resecived a B.Eng from ACM Honors Class, Zhiyuan College, Shanghai Jiao Tong University.

Education

Cornell University, USA
Ph.D. in Computer Science, Aug. 2018 to Present

Shanghai Jiao Tong University, China
Bachelor of Engineering, Sep. 2014 to Jun. 2018

Cornell University, USA
Research Intern, June. 2017 to Dec. 2017

Preprints

Zero Training Overhead Portfolios for Learning to Solve Combinatorial Problems
Yiwei Bai, Wenting Zhao, Carla P. Gomes
Under review.

We have observed that well-trained models for combinatorial problems acquired in the same training trajectory, with similar top validation performance, perform well on very different validation instances

ZTop leverages these diverse models to increase the test performance with (almost) zero training overhead.

Publication

Automating Crystal-Structure Phase Mapping by Combining Deep Learning with Constraint Reasoning
Di Chen, Yiwei Bai, Sebastian Ament, Wenting Zhao, Dan Guevarra, Lan Zhou, Bart Selman, R.Bruce van Dover, John M. Gregoire, Carla P. Gomes
Nature Machine Intelligence 2021, Cover story.

CLR-DRNets: Curriculum Learning with Restarts to Solve Visual Combinatorial Games
Yiwei Bai, Di Chen, Carla P. Gomes
CP 2021.

Fairness of Exposure in Stochastic Bandits
Lequn Wang, Yiwei Bai, Wen Sun, Thorsten Joachims
ICML 2021.

Deep Reasoning Networks for Unsupervised Pattern De-mixing with Constraint Reasoning
Di Chen, Yiwei Bai, Wenting Zhao, Sebastian Ament, John M. Gregoire, Carla P. Gomes
ICML 2020.

Batch Learning from Bandit Feedback through Bias Corrected Reward Imputation
Lequn Wang, Yiwei Bai, Arjun Bhalla, Thorsten Joachims
Appears in Real-world Sequential Decision Making workshop, ICML, 2019.

Scalable Relaxations of Sparse Packing Constraints: Optimal Biocontrol in Predator-Prey Networks
Johan Bjorcks, Yiwei Bai, Yexiang Xue, Xiaojian Wu, Mark Whitemore, Carla P. Gomes
In Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence (AAAI), 2018.

An Empirical Study of Collective Behaviors in Many-agent Reinforcement Learning (Extended Abstract)
Yiwei Bai^*, Lantao Yu^*, Yaodong Yang^*, Jun Wang, Weinan Zhang, Ying Wen, Yong Yu
In the Proceedings of the 17th International Conference on Autonomous Agents and Multiagent Systems (AAMAS), 2018.

Research Projects

Yi, an AI platform playing the GO(game)
Yiwei Bai, Lequn Chen, and colleagues in Tianrang, (advised by Professor Guirong Xue), Jan. 2017

Yi won the Four-th prize in the first International Computer Go Competition

Yi won the Ninth place in the 10th UEC Cup

I and colleagues trained and tuned the policy network and value network

I and colleagues designed and implemented the reinforcement learning framework of the value network incorporated with policy network