- About
- Events
- Calendar
- Graduation Information
- Cornell Tech Colloquium
- Student Colloquium
- BOOM
- Spring 2023 Colloquium
- Conway-Walker Lecture Series
- Salton Lecture Series
- Seminars / Lectures
- Big Red Hacks
- Cornell University High School Programming Contests 2023
- Game Design Initiative
- CSMore: The Rising Sophomore Summer Program in Computer Science
- Explore CS Research
- Research Night
- Cornell Junior Theorists' Workshop
- People
- Courses
- Research
- Undergraduate
- M Eng
- MS
- PhD
- Admissions
- Current Students
- Computer Science Graduate Office Hours
- Business Card Policy
- Cornell Tech
- Curricular Practical Training
- Exam Scheduling Guidelines
- Fellowship Opportunities
- Field of Computer Science Ph.D. Student Handbook
- Graduate TA Handbook
- Field A Exam Summary Form
- Graduate School Forms
- Instructor / TA Application
- Ph.D. Requirements
- Ph.D. Student Financial Support
- Special Committee Selection
- Travel Funding Opportunities
- The Outside Minor Requirement
- Diversity and Inclusion
- Graduation Information
- CS Graduate Minor
- Outreach Opportunities
- Parental Accommodation Policy
- Special Masters
- Student Spotlights
- Contact PhD Office
The Cornell Database Group is interested in all aspects of data analysis and database management. This includes projects at the intersection between database systems and other areas such as machine learning or natural language processing. For recent news, visit the Cornell Database Group Homepage or follow us on Twitter.
Recent Publications
2023
- VLDB 2023 SkinnerMT: parallelizing for efficiency and robustness in adaptive query processing on multicore platforms. Ziyun Wei, Immanuel Trummer.
- SIGMOD 2023 Demonstration of ThalamusDB: answering complex SQL queries with natural language predicates on multi-modal data. Saehan Jo, Immanuel Trummer.
- SIGMOD 2023 Demonstrating NaturalMiner: searching large data sets for abstract patterns described in natural language. Immanuel Trummer.
2022
- VLDB 2022, PhD Workshop Building learned federated query optimizers. Victor Giannakouris, Immanuel Trummer.
- VLDB 2022 CodexDB: generating code for processing SQL queries using GPT-3 Codex. Immanuel Trummer.
- VLDB 2022 Black-box optimization of comparative data summaries via reinforcement learning. Immanuel Trummer.
- VLDB 2022 From BERT to GPT-3 Codex: harnessing the potential of very large language models for data management. Immanuel Trummer.
- VLDB 2022 UDO: universal database optimization using reinforcement learning. Junxiong Wang, Immanuel Trummer, Debabrota Basu.
- SIGMOD 2022 Demonstrating DB-BERT: a database tuning tool that “reads the manual”. Immanuel Trummer.
- AAAI 2022 Procrastinated tree search: black-box optimization with delayed, noisy, and multi-fidelity feedback. Junxiong Wang, Debabrota Basu, Immanuel Trummer.
- SIGMOD 2022 DB-BERT: a database tuning tool that “reads the manual”. Immanuel Trummer.
- CIDR 2022 Towards NLP-enhanced data profiling tools. (Abstract) Immanuel Trummer.
2021
- TODS 2021 “Best of SIGMOD” Edition SkinnerDB: regret-bounded query evaluation via reinforcement learning. Immanuel Trummer, Junxiong Wang, Ziyun Wei et al.
- VLDB 2021 The case for NLP-enhanced database tuning: towards tuning tools that read the manual. Immanuel Trummer.
- VLDB 2021 Robust voice querying with MUVE: optimally visualizing results of phonetically similar queries. Ziyun Wei, Immanuel Trummer, Connor Anderson.
- IEEE Data Engineering Bulletin WebChecker: towards an infrastructure for efficient misinformation detection at Web scale. Immanuel Trummer.
- SIGMOD Record 2021 Database tuning using natural language processing. Immanuel Trummer.
- SIGMOD 2021 Demonstrating UDO: a unified approach for optimizing transaction code, physical design, and system parameters via reinforcement learning. Junxiong Wang, Immanuel Trummer, Debabrota Basu.
- SIGMOD 2021 Demonstrating robust voice querying with MUVE: optimally visualizing results of phonetically similar queries. Ziyun Wei, Immanuel Trummer, Connor Anderson.
- ICDE 2021 Optimally summarizing data by small fact sets for concise answers to voice queries. Immanuel Trummer, Connor Anderson.