Menu:

Cornell Database Group Publications

2013

Benjamin Sowell, Marcos Antonio Vaz Salles, Tuan Cao, Alan J. Demers, Johannes Gehrke: An Experimental Analysis of Iterated Spatial Joins in Main Memory. PVLDB 6(14): 1882-1893 (2013)

Wenlei Xie, Guozhang Wang, David Bindel, Alan J. Demers, Johannes Gehrke: Fast Iterative Graph Computation with Block Updates. PVLDB 6(14): 2014-2025 (2013)

Raphael M. Reischuk, Florian Schroeder, Johannes Gehrke: Secure and customizable web development in the safe activation framework. ACM Conference on Computer and Communications Security 2013

Johannes Gehrke: Big Data Pipelines. CIDR 2013

Karthik Raman, Adith Swaminathan, Johannes Gehrke, Thorsten Joachims: Beyond myopic inference in big data pipelines. KDD 2013

Yin Lou, Rich Caruana, Johannes Gehrke, Giles Hooker: Accurate intelligible models with pairwise interactions. KDD 2013

Gabriel Bender, Lucja Kot, Johannes Gehrke, Christoph Koch. Fine-Grained Disclosure Control for App Ecosystems. SIGMOD 2013.

Sudip Roy, Lucja Kot, Christoph Koch. Quantum Databases. CIDR 2013.

Guozhang Wang, Wenlei Xie, Alan Demers, and Johannes Gehrke. Asynchronous Large-Scale Graph Processing Made Easy. CIDR 2013.

2012

Konstantinos Mamouras, Sigal Oren, Lior Seeman, Lucja Kot, Johannes Gehrke. The Complexity of Social Coordination. VLDB 2012.

B. Sowell, W. Golab, and M.A. Shah. Minuet: A Scalable Distributed Multiversion B-Tree. PVLDB 5(9):884-895, 2012.

Yin Lou, Rich Caruana and Johannes Gehrke. Intelligible Models for Classification and Regression. KDD 2012

Johannes Gehrke, Michael Hay, Edward Lui, Rafael Pass. Crowd-Blending Privacy. CRYPTO 2012

Michaela Goetz, Suman Nath, Johannes Gehrke. MaskIt: privately releasing user context streams for personalized mobile applications. SIGMOD Conference 2012: 289-300

Raphael M. Reischuk, Michael Backes, Johannes Gehrke. SAFE extensibility of data-driven web applications. WWW 2012: 799-808

Michaela Goetz, Ashwin Machanavajjhala, Guozhang Wang, Xiaokui Xiao, Johannes Gehrke. Publishing Search Logs - A Comparative Study of Privacy Guarantees. IEEE Trans. Knowl. Data Eng. 24(3): 520-532 (2012)

2011

Truls A. Bjørklund , Michaela Goetz, Johannes Gehrke, and Nils Grimsmo. Workload-Aware Indexing for Keyword Search in Social Networks. CIKM 2011

Tao Zou, Guozhang Wang, Marcos Vaz Salles, David Bindel, Alan Demers, Johannes Gehrke, and Walker White. Making Time-stepped Applications Tick in the Cloud. SOCC 2011

Nitin Gupta, Milos Nikolic, Sudip Roy, Gabriel Bender, Lucja Kot, Johannes Gehrke, Christoph Koch. Entangled Transactions. VLDB 2011.

Xiaokui Xiao, Gabriel Bender, Michael Hay, Johannes Gehrke. iReduct: Differential Privacy with Reduced Relative Errors. SIGMOD 2011.

Jiaqi Zhai, Yin Lou, Johannes Gehrke. ATLAS: A Probabilistic Algorithm for High Dimensional Similarity Search. SIGMOD 2011.

Tuan Cao, Marcos Vaz Salles, Benjamin Sowell, Yao Yue, Johannes Gehrke, Alan Demers, Walker White. Fast Checkpoint Recovery Algorithms for Frequently Consistent Applications. SIGMOD 2011.

Nitin Gupta, Lucja Kot, Sudip Roy, Gabriel Bender, Johannes Gehrke, Christoph Koch. Entangled queries: enabling declarative data-driven coordination. SIGMOD 2011. (Best Paper Award Winner)

Oliver Kennedy, Suman Nath. Jigsaw: Efficient optimization over uncertain enterprise data. SIGMOD 2011.

Tuan Cao, Benjamin Sowell, Marcos Vaz Salles, Alan Demers, Johannes Gehrke. BRRL: A Recovery Library for Main-Memory Applications in the Cloud (Demonstration Paper). SIGMOD 2011.

Nitin Gupta, Lucja Kot, Gabriel Bender, Sudip Roy, Johannes Gehrke, Christoph Koch. Declarative Data-Driven Coordination in the Youtopia System (Demonstration Paper). SIGMOD 2011.

Oliver Kennedy, Suman Nath, Steve Lee, Slawek Smyl, Charles Loboz. Fuzzy Prophet: Parameter exploration in uncertain enterprise scenarios (Demonstration Paper). SIGMOD 2011.

Oliver Kennedy, Yanif Ahmad, Christoph Koch. DBToaster: Agile Views in a Dynamic Data Management System. CIDR 2011.

2010

J. Y. Halpern, From causal models to counterfactual structures, Proceedings of the Twelfth International Conference on Principles of Knowledge Representation and Reasoning (KR 2010), 2010.

R. Dechter, H. Geffner, J.Y. Halpern, Heuristics, Probability, and Causality: A Tribute to Judea Pearl, College Publications, 2010.

J.Y. Halpern and C. Hitchcock, Actual causation and the art of modeling, in Heuristics, Probability and Causality: A Tribute to Judea Pearl, (editors, R. Dechter, H. Geffner, and J. Y. Halpern), College Publications, 2010, pp. 383-406

A. Meliou, W. Gatterbauer, J.Y. Halpern, C. Koch, K. F. Moore, and D. Suciu, Causality in databases, IEEE Data Engineering Bulletin 33:3, 2010, pp. 59-67.

Lucja Kot, Nitin Gupta, Sudip Roy, Johannes Gehrke, and Christoph Koch. Beyond Isolation: Research Opportunities in Declarative Data-Driven Coordination. SIGMOD Record, 2010.

Guozhang Wang, Marcos Vaz Salles, Benjamin Sowell, Xun Wang, Tuan Cao, Alan Demers, Johannes Gehrke, Walker White. Behavioral Simulations in MapReduce. VLDB, 2010.

Arvind Arasu, Michaela Goetz, Raghav Kaushik. On Active Learning of Record Matching Packages. SIGMOD, 2010.

Christoph Koch. Incremental Query Evaluation in a Ring of Databases. PODS, 2010

Daniel Deutch, Christoph Koch, and Tova Milo. On Probabilistic Fixpoint and Markov Chain Query Languages. PODS, 2010

Truls A. Bjørklund, Michaela Goetz, Johannes Gehrke.Search in Social Networks with Access Control. KEYS Workshop on Keyword Search on Structured Data, held in association with SIGMOD 2010.

Xiaokui Xiao, Guozhang Wang, Johannes Gehrke. Differential privacy via wavelet transforms. ICDE 2010: 225-236.

Johannes Gehrke, Daniel Kifer, Ashwin Machanavajjhala.Privacy in data publishing. ICDE 2010: 1213.

Dan Olteanu, Jiewen Huang, Christoph Koch: Approximate confidence computation in probabilistic databases. ICDE 2010: 145-156.

Oliver Kennedy, Christoph Koch: PIP: A database system for great and small expectations. ICDE 2010: 157-168.

Marcos Antonio Vaz Salles, Jens Dittrich, Lukas Blunschi: Intensional associations in dataspaces. ICDE 2010: 984-987.

Guozhang Wang, Marcos Antonio Vaz Salles, Benjamin Sowell, Xun Wang, Tuan Cao, Alan J. Demers, Johannes Gehrke, Walker M. White: Behavioral Simulations in MapReduce. CoRR abs/1005.3773: (2010).

2009

Ashwin Machanavajjhala, Johannes Gehrke, Michaela Goetz. Data Publishing against Realistic Adversaries. PVLDB 2(1): 790-801 (2009).

Marcos Vaz Salles, Tuan Cao, Benjamin Sowell, Alan Demers, Johannes Gehrke, Christoph Koch, and Walker White. An Evaluation of Checkpoint Recovery for Massively Multiplayer Online Games. PVLDB 2(1): 1258-1269 (2009).

Lucja Kot, Christoph Koch: Cooperative Update Exchange in the Youtopia System. PVLDB 2(1): 193-204 (2009).

Yanif Ahmad, Christoph Koch: DBToaster: A SQL Compiler for High-Performance Delta Processing in Main-Memory Databases. PVLDB 2(2): 1566-1569 (2009)

Truls A Bjørklund, Johannes Gehrke and Øystein Torbjørnsen.A Confluence of Column Stores and Search Engines: Opportunities and Challenges. Proceedings of the USETIM 2009 Workshop (Using Search Engine Technology for Information Management), held in association with VLDB 2009.

Jayant Madhavan, Loredana Afanasiev, Lyublena Antova, Alon Y. Halevy: Harnessing the Deep Web: Present and Future. CIDR 2009

Xiaokui Xiao, Guozhang Wang, Johannes Gehrke: Interactive anonymization of sensitive data. SIGMOD 2009: 1051-1054

Alan J. Demers, Johannes Gehrke, Christoph Koch, Ben Sowell, Walker M. White: Database research in computer games. SIGMOD 2009: 1011-1014

Jiewen Huang, Lyublena Antova, Christoph Koch, Dan Olteanu: MayBMS: a probabilistic database management system. SIGMOD 2009: 1071-1074

Christoph Koch. MayBMS: A System for Managing Large Uncertain and Probabilistic Databases. Chapter 6 of Charu Aggarwal, ed., Managing and Mining Uncertain Data, Springer-Verlag, 2009.

Rakesh Agrawal, Anastasia Ailamaki, Philip A. Bernstein, Eric A. Brewer, Michael J. Carey, Surajit Chaudhuri, AnHai Doan, Daniela Florescu, Michael J. Franklin, Hector Garcia-Molina, Johannes Gehrke, Le Gruenwald, Laura M. Haas, Alon Y. Halevy, Joseph M. Hellerstein, Yannis E. Ioannidis, Henry F. Korth, Donald Kossmann, Samuel Madden, Roger Magoulas, Beng Chin Ooi, Tim O'Reilly, Raghu Ramakrishnan, Sunita Sarawagi, Michael Stonebraker, Alexander S. Szalay, Gerhard Weikum: The Claremont report on database research. Also in Communications of the ACM 52(6): 56-65 (2009)

Walker White, Christoph Koch, Johannes Gehrke, and Alan Demers.Better Scripts, Better Games. Communications of the ACM 52(3): 42-47, March 2009.

Jens Dittrich, Lukas Blunschi, Marcos Antonio Vaz Salles. Indexing Moving Objects using Short-Lived Throwaway Indexes.. SSTD 2009.

Jens Dittrich, Marcos Antonio Vaz Salles, Lukas Blunschi. iMeMex: From Search to Information Integration and Back. IEEE Data Engineering Bulleting 2009, Vol. 32 No. 2 (invited paper)

Christoph Koch: A compositional query algebra for second-order logic and uncertain databases. ICDT 2009: 127-140

Michaela Goetz, Christoph Koch. A compositional framework for complex queries over uncertain data. ICDT 2009: 149-161.

Michael Benedikt, Christoph Koch: From XQuery to relational logics. ACM Trans. Database Syst. 34(4): (2009).

Lyublena Antova, Christoph Koch, Dan Olteanu: 10^(10^6) Worlds and Beyond: Efficient Representation and Processing of Incomplete Information. VLDB J. 18(5): 1021-1040 (2009).

Truls A. Bjørklund, Nils Grimsmo, Johannes Gehrke, Øystein Torbjørnsen: Inverted indexes vs. bitmap indexes in decision support systems. CIKM 2009: 1509-1512.

Lars Brenna, Johannes Gehrke, Mingsheng Hong, Dag Johansen: Distributed event stream processing with non-deterministic finite automata. DEBS 2009.

Nitin Gupta, Alan J. Demers, Johannes Gehrke, Philipp Unterbrunner, Walker M. White: Scalability for Virtual Worlds. ICDE 2009: 1311-1314.

Oliver Kennedy, Christoph Koch, Alan J. Demers: Dynamic Approaches to In-network Aggregation. ICDE 2009: 1331-1334.

Dan Olteanu, Jiewen Huang, Christoph Koch: SPROUT: Lazy vs. Eager Query Plans for Tuple-Independent Probabilistic Databases. ICDE 2009: 640-651.

Mingsheng Hong, Mirek Riedewald, Christoph Koch, Johannes Gehrke, Alan J. Demers: Rule-based multi-query optimization. EDBT 2009: 120-131.

Ben Sowell, Alan J. Demers, Johannes Gehrke, Nitin Gupta, Haoyuan Li, and Walker M. White: From Declarative Languages to Declarative Processing in Computer Games. CIDR 2009.

Alin Dobra, Minos N. Garofalakis, Johannes Gehrke, Rajeev Rastogi: Multi-query optimization for sketch-based estimation. Information Systems. 34(2): 209-230 (2009).

Michaela Goetz, Christoph Koch, Wim Martens. Efficient Algorithms for Descendant-Only Tree Pattern Queries. Information Systems. 34(7): 602-623 (2009).

Minos N. Garofalakis, Johannes Gehrke, and Divesh Srivastava: Special issue: best papers of VLDB 2007. VLDB Journal 18(2): 383-384 (2009).

Christoph Koch: Applications of Automata in XML Processing. CIAA 2009: 2.

Johannes Gehrke: Technical perspective - Data stream processing: when you only get one look. Commun. ACM 52(10): 96 (2009)

Mingsheng Hong, Alan J. Demers, Johannes Gehrke, Mirek Riedewald: Event and Pattern Detection over Streams. Encyclopedia of Database Systems 2009: 1029-1033

Ashwin Machanavajjhala, Johannes Gehrke: Randomization Methods to Ensure Data Privacy. Encyclopedia of Database Systems 2009: 2319-2324

Johannes Gehrke: Scalable Decision Tree Construction. Encyclopedia of Database Systems 2009: 2469-2474

Yong Yao, Johannes Gehrke: Continuous Queries in Sensor Networks. Encyclopedia of Database Systems 2009: 488-492

Biswanath Panda, Johannes Gehrke, Mirek Riedewald: Database Techniques to Improve Scientific Simulations. Encyclopedia of Database Systems 2009: 733-738

Johannes Gehrke: DBMS Component. Encyclopedia of Database Systems 2009: 755

Johannes Gehrke: DBMS Interface. Encyclopedia of Database Systems 2009: 755-756

Christoph Koch: Logical Foundations of Web Data Extraction. Encyclopedia of Database Systems 2009: 1649-1652

Christoph Koch: Parameterized Complexity of Queries. Encyclopedia of Database Systems 2009: 2041-2044

Christoph Koch: XML Stream Processing. Encyclopedia of Database Systems 2009: 3634-3637

Yanif Ahmad, Ugur Çetintemel: Data Stream Management Architectures and Prototypes. Encyclopedia of Database Systems 2009: 639-643

Michaela Goetz, Ashwin Machanavajjhala, Guozhang Wang, Xiaokui Xiao, Johannes Gehrke: Privacy in Search Logs. CoRR abs/0904.0682: (2009).

Benjamin Sowell, Alan J. Demers, Johannes Gehrke, Nitin Gupta, Haoyuan Li, Walker M. White: From Declarative Languages to Declarative Processing in Computer Games. CoRR abs/0909.1770: (2009).

Xiaokui Xiao, Guozhang Wang, Johannes Gehrke: Differential Privacy via Wavelet Transforms. CoRR abs/0909.5530: (2009).

Lucja Kot, Christoph Koch: Cooperative Update Exchange in the Youtopia System. CoRR abs/0903.5346: (2009).

2008

Walker White, Christoph Koch, Johannes Gehrke, and Alan Demers. Better Scripts, Better Games. ACM Queue, Vol. 6, No. 7, November/December 2008

W. White, B. Sowell, J. Gehrke, and A. Demers: Declarative Processing for Computer Games. In Proc. of the 2008 ACM SIGGRAPH Sandbox Symposium (Sandbox 2008).>

Namit Jain, Shailendra Mishra, Anand Srinivasan, Johannes Gehrke, Jennifer Widom, Hari Balakrishnan, Ugur Çetintemel , Mitch Cherniack , Richard Tibbetts , Stanley B. Zdonik: Towards a streaming SQL standard. PVLDB 1(2): 1379-1390 (2008)

Felix Weigel, Biswanath Panda, Mirek Riedewald, Johannes Gehrke, Manuel Calimlim: Large-scale collaborative analysis and extraction of web data. PVLDB 1(2): 1476-1479 (2008)

Jayant Madhavan, David Ko, Lucja Kot, Vignesh Ganapathy, Alex Rasmussen, Alon Y. Halevy: Google's Deep Web crawl. PVLDB 1(2): 1241-1252 (2008)

Nitin Gupta, Alan J. Demers, and Johannes Gehrke: SEMMO: a scalable engine for massively multiplayer online games. SIGMOD Conference 2008: 1235-1238.

Christoph Koch. Approximating Predicates and Expressive Queries on Probabilistic Databases Proc. PODS 2008.

Robert Albright, Alan J. Demers, Johannes Gehrke, Nitin Gupta, Hooyeon Lee, Rick Keilty, Gregory Sadowski, Ben Sowell, and Walker M. White: SGL: a scalable language for data-driven games. SIGMOD Conference 2008: 1217-1222.

David Martin, Johannes Gehrke, and Joseph Halpern. Toward Expressive and Scalable Sponsored Search Auctions. ICDE Conference 2008. Cancun, Mexico, April 2008.

Ashwin Machanavajjhala, Daniel Kifer, John Abowd, Johannes Gehrke, and Lars Vilhuber. Privacy: From Theory to Practice on the Map. ICDE Conference 2008. Cancun, Mexico, April 2008.

Christoph Koch, Stefanie Scherzinger, Michael Schmidt: XML Prefiltering as a String Matching Problem. ICDE 2008: 626-635.

Lyublena Antova, Thomas Jansen, Christoph Koch, Dan Olteanu: Fast and Simple Relational Processing of Uncertain Data. ICDE 2008: 983-992.

Samuel Madden, Johannes Gehrke: Declarative, Domain-Specific Languages - Elegant Simplicity or a Hammer in Search of a Nail? ICDE 2008: 7

David J. Martin, Johannes Gehrke, Joseph Y. Halpern: Toward Expressive and Scalable Sponsored Search Auctions. CoRR abs/0809.0116: (2008).

Christoph Koch: A Compositional Query Algebra for Second-Order Logic and Uncertain Databases. CoRR abs/0807.4620 (2008).

Christoph Koch, Dan Olteanu: Conditioning Probabilistic Databases. CoRR abs/0803.2212 (2008).

Oliver Kennedy, Christoph Koch, Alan J. Demers: Dynamic Approaches to In-Network Aggregation. CoRR abs/0810.3227 (2008).

2007

Biswanath Panda, Mirek Riedewald, Johannes Gehrke, Stephen B. Pope: High-Speed Function Approximation. ICDM 2007: 613-618.

Alan J. Demers, Johannes Gehrke, Biswanath Panda, Mirek Riedewald, Varun Sharma, Walker M. White: Cayuga: A General Purpose Event Monitoring System. CIDR 2007: 412-422.

David J. Martin, Daniel Kifer, Ashwin Machanavajjhala, Johannes Gehrke, Joseph Y. Halpern: Worst-Case Background Knowledge for Privacy-Preserving Data Publishing. ICDE 2007: 126-135.

Lyublena Antova, Christoph Koch, Dan Olteanu: MayBMS: Managing Incomplete Information with Probabilistic World-Set Decompositions. ICDE 2007: 1479-1480.

Michael Schmidt, Stefanie Scherzinger, Christoph Koch: Combined Static and Dynamic Analysis for Effective Buffer Minimization in Streaming XQuery Evaluation. ICDE 2007: 236-245.

Lyublena Antova, Christoph Koch, Dan Olteanu: 10^(10^6) Worlds and Beyond: Efficient Representation and Processing of Incomplete Information. ICDE 2007: 606-615.

Lyublena Antova, Christoph Koch, Dan Olteanu: World-Set Decompositions: Expressiveness and Efficient Algorithms. ICDT 2007: 194-208.

Lucja Kot and Walker White: Characterization of the Interaction of XML Functional Dependencies with DTDs. ICDT 2007: 119-133.

Walker M. White, Mirek Riedewald, Johannes Gehrke, Alan J. Demers: What is "next" in event processing? PODS 2007: 263-272.

Lars Brenna, Alan J. Demers, Johannes Gehrke, Mingsheng Hong, Joel Ossher, Biswanath Panda, Mirek Riedewald, Mohit Thatte, Walker M. White: Cayuga: a high-performance event processing engine. SIGMOD Conference 2007: 1100-1102.

Nitin Gupta, Fan Yang, Alan J. Demers, Johannes Gehrke, Jayavel Shanmugasundaram: User-centric personalized extensibility for data-driven web applications. SIGMOD Conference 2007: 1125-1127.

Adina Crainiceanu, Prakash Linga, Ashwin Machanavajjhala, Johannes Gehrke, Jayavel Shanmugasundaram: P-ring: an efficient and robust P2P range index structure. SIGMOD Conference 2007: 223-234.

Walker M. White, Alan J. Demers, Christoph Koch, Johannes Gehrke, Rajmohan Rajagopalan: Scaling games to epic proportion. SIGMOD Conference 2007: 31-42.

Lyublena Antova, Christoph Koch, Dan Olteanu: From complete to incomplete information and back. SIGMOD Conference 2007: 713-724.

Mingsheng Hong, Alan J. Demers, Johannes Gehrke, Christoph Koch, Mirek Riedewald, Walker M. White: Massively multi-query join processing in publish/subscribe systems. SIGMOD Conference 2007: 761-772.

Christoph Koch, Stefanie Scherzinger, Michael Schmidt: The GCX System: Dynamic Buffer Minimization in Streaming XQuery Evaluation. VLDB 2007: 1378-1381.

Lyublena Antova, Christoph Koch, Dan Olteanu: Query language support for incomplete information in the MayBMS system. VLDB 2007: 1422-1425.

Fan Yang, Nitin Gupta, Nicholas Gerner, Xin Qi, Alan J. Demers, Johannes Gehrke, Jayavel Shanmugasundaram: A unified platform for data driven web applications with automatic client-server partitioning. WWW 2007: 341-350.

Zhiyuan Chen, Johannes Gehrke, Flip Korn, Nick Koudas, Jayavel Shanmugasundaram, Divesh Srivastava: Index structures for matching XML twigs using relational query processors. Data Knowl. Eng. 60(2): 283-302 (2007).

Martin Grohe, Christoph Koch, Nicole Schweikardt: Tight lower bounds for query processing on streaming and external memory data. Theor. Comput. Sci. 380(1-2): 199-217 (2007).

Niki Trigoni, Yong Yao, Alan J. Demers, Johannes Gehrke, Rajmohan Rajaraman: Wave scheduling and routing in sensor networks. TOSN 3(1): 2 (2007).

Michaela Goetz, Christoph Koch, Wim Martens: Efficient Algorithms for the Tree Homeomorphism Problem. DBPL 2007: 17-31.

Christoph Koch, Stefanie Scherzinger: Attribute grammars for scalable query processing on XML streams. VLDB J. 16(3): 317-342 (2007).

Walker M. White, Christoph Koch, Nitin Gupta, Johannes Gehrke, Alan J. Demers: Database research opportunities in computer games. SIGMOD Record 36(3): 7-13 (2007).

Ashwin Machanavajjhala, Daniel Kifer, Johannes Gehrke, Muthuramakrishnan Venkitasubramaniam: L-diversity: Privacy beyond k-anonymity. TKDD 1(1): (2007).

David J. Martin, Daniel Kifer, Ashwin Machanavajjhala, Johannes Gehrke, Joseph Y. Halpern: Worst-Case Background Knowledge for Privacy-Preserving Data Publishing. CoRR abs/0705.2787: (2007).

Daria Sorokina, Johannes Gehrke, Simeon Warner, Paul Ginsparg: Plagiarism Detection in arXiv. CoRR abs/cs/0702012: (2007).

Lyublena Antova, Thomas Jansen, Christoph Koch, Dan Olteanu: Fast and Simple Relational Processing of Uncertain Data. CoRR abs/0707.1644 (2007).

Lyublena Antova, Christoph Koch, Dan Olteanu: World-set Decompositions: Expressiveness and Efficient Algorithms. CoRR abs/0705.4442 (2007).

2006

Ashwin Kumar V Machanavajjhala and Johannes Gehrke. On the Efficiency of Checking Perfect Privacy. In Proceedings of the 25th ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems (PODS 2006).

Daniel Kifer and J. E. Gehrke. Injecting Utility into Anonymized Datasets . In Proceedings of the 2006 ACM SIGMOD International Conference on Management of Data (SIGMOD 2006).

William Y. Arms, Selcuk Aya, Manuel Calimlim, Jim Cordes, Julia Deneva, Pavel Dmitriev, Johannes Gehrke, Lawrence Gibbons, Christopher D. Jones, Valentin Kuznetsov, Dave Lifka, Mirek Riedewald, Dan Riley, Anders Ryd, and Gregory J. Sharp. Three Case Studies of Large-Scale Data Flows . In Proceedings of the IEEE Workshop on Workflow and Data Flow for Scientific Applications (SciFlow 2006). Atlanta Georgia, April 2006.

Ashwin Machanavajjhala, Johannes Gehrke, Daniel Kifer, and Muthuramakrishnan Venkitasubramaniam. l-Diversity: Privacy Beyond k-Anonymity . In Proceedings of the 22nd IEEE International Conference on Data Engineering (ICDE 2006). Atlanta Georgia, April 2006.

Jayavel Shanmugasundaram, Fan Yang, Mirek Riedewald, Johannes Gehrke, and Alan Demers. Hilda: A High-Level Language for Data-Driven Web Applications. In Proceedings of the 22nd IEEE International Conference on Data Engineering (ICDE 2006), Atlanta Georgia, April 2006.

Alan Demers, Johannes Gehrke, Mingsheng Hong, Mirek Riedewald, and Walker White. Towards Expressive Publish/Subscribe Systems . In Proceedings of the 10th International Conference on Extending Database Technology (EDBT 2006), Munich, Germany, March 2006.

Chavdar Botev, Sihem Amer-Yahia, and Jayavel Shanmugasundaram. Expressiveness and Performance of Full-Text Search Languages. In Proceedings of the 10th International Conference on Extending Database Technology (EDBT 2006), Munich, Germany, March 2006. 

2005

Lin Guo, Jayavel Shanmugasundaram, Kevin Beyer, Eugene Shekita, "Efficient Inverted Lists and Query Algorithms for Structured Value Ranking in Update-Intensive Relational Databases", In Proceedings of the IEEE International Conference on Data Engineering (ICDE) , Tokyo, Japan, April 2005.

Feng Shao, Antal Novak, Jayavel Shanmugasundaram, "Triggers over XML Views of Relational Data", In Proceedings of the IEEE International Conference on Data Engineering (ICDE) (poster) , Tokyo, Japan, April 2005.

2004

Abhinandan Das , Johannes Gehrke, and Mirek Riedewald . Approximation Techniques for Spatial Data . In Proceedings of the 2004 ACM SIGMOD International Conference on Management of Data (SIGMOD 2004) . Paris, France, June 2004.

Sihem Amer-Yahia, Chavdar Botev, and Jayavel Shanmugasundaram. TeXQuery: A Full-Text Search Extension to XQuery .

2003

Lin Guo , Feng Shao, Chavdar Botev, and Jayavel Shanmugasundaram XRANK: Ranked Keyword Search over XML Documents . In Proceedings of the the 2003 ACM SIGMOD International Conference on Management of Data (SIGMOD 2003). San Diego, CA, June 2003.

Abhinandan Das , J. E.  Gehrke, and Mirek Riedewald . Approximate Join Processing Over Data Streams . In Proceedings of the the 2003 ACM SIGMOD International Conference on Management of Data (SIGMOD 2003). San Diego, CA, June 2003.

Daniel Kifer , J. E. Gehrke, Cristian Bucila , and Walker White. How to Quickly Find a Witness . In Proceedings of the 22nd ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems (PODS 2003).  San Diego, CA, June 2003.

Alexandre Evfimievski , J. E. Gehrke, and Ramakrishnan Srikant. Limiting Privacy Breaches in Privacy Preserving Data Mining . In Proceedings of the 22nd ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems (PODS 2003).  San Diego, CA, June 2003.

2002

Cristian Bucila, J. E. Gehrke, Daniel Kifer, and Walker White. DualMiner: A Dual-Pruning Algorithm for Itemsets with Constraints. In Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining . Edmonton, Alberta, Canada, July 2002.

Alexandre Evfimievski, Ramakrishnan Srikant, Rakesh Agrawal, and J. E. Gehrke. Privacy Preserving Mining of Association Rules . In Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining . Edmonton, Alberta, Canada, July 2002.

Shai Ben-David, J. E. Gehrke, and Reba Schuller. A Theoretical Framework for Learning from a Pool of Disparate Data Sources . In Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining . Edmonton, Alberta, Canada, July 2002.

Jay Ayres, J. E. Gehrke, Tomi Yiu, and Jason Flannick. Sequential Pattern Mining Using Bitmaps . In Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining . Edmonton, Alberta, Canada, July 2002.

Alin Dobra and Johannes Gehrke. SECRET: A Scalable Linear Regression Tree Algorithm . In Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining . Edmonton, Alberta, Canada, July 2002.

Dobra, M. Garofalakis, J. E. Gehrke, and R. Rastogi. Processing Complex Aggregate Queries over Data Streams , In Proceedings of the 2002 ACM Sigmod International Conference on Management of Data , Madison, Wisconsin, June 2002.

Tatarinov, E. Viglas, K. Beyer, J. Shanmugasundaram, E. Shekita, "Storing and Querying Ordered XML Using a Relational Database System", In Proceedings of the 2002 ACM Sigmod International Conference on Management of Data , Madison, Wisconsin, June 2002.

F. Chu, J. Halpern, and J. E. Gehrke. Least Expected Cost Query Optimization: What Can We Expect? In Proceedings of the 21st ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems (PODS 2002) . Madison, Wisconsin, June 2002.

Anton Faradjian, J. E. Gehrke, and Philippe Bonnet. GADT: A Probability Space ADT For Representing and Querying the Physical World. In Proceedings of the 18th International Conference on Data Engineering (ICDE 2002) , San Jose, California, February 2002.

2001

J. Shanmugasundaram, E. Shekita, R. Barr, M. Carey, B. Lindsay, H. Pirahesh, B. Reinwald, " Efficiently Publishing Relational Data as XML Documents ", VLDB Journal . An earlier version appeared in the VLDB 2000 conference.

J. Shanmugasundaram, J. Kiernan, E. Shekita, C. Fan, J. Funderburk, " Querying XML Views of Relational Data ", In Proceedings of the VLDB Conference, Rome, Italy, September 2001.

J. Shanmugasundaram, E. Shekita, J. Kiernan, R. Krishnamurthy, E. Viglas, J. Naughton, I. Tatarinov, " A General Technique for Querying XML Documents using a Relational Database System ," SIGMOD Record , September 2001.

Alin Dobra and J. E. Gehrke. Bias Correction in Classification Tree Construction. In Proceedings of the Seventeenth International Conference on Machine Learning (ICML 2001) , Williams College, Massachusetts, June 2001.

Zhiyuan Chen, J. E. Gehrke, and Flip Korn. Query Optimization In Compressed Database Systems. In Proceedings of the 2001 ACM Sigmod International Conference on Management of Data , Santa Barbara, California, May 2001.

J. E. Gehrke, Flip Korn, and Divesh Srivastava . On Computing Correlated Aggregates Over Continual Data Streams. In Proceedings of the 2001 ACM Sigmod International Conference on Management of Data , Santa Barbara, California, May 2001.

Doug Burdick, Manuel Calimlim, and J. E. Gehrke. MAFIA: A Maximal Frequent Itemset Algorithm for Transactional Databases . In Proceedings of the 17th International Conference on Data Engineering , Heidelberg, Germany, April 2001.

J. Shanmugasundaram, K. Tufte, D. DeWitt, J. Naughton, D. Maier, "Architecting a Network Query Engine for Producing Partial Results", Lecture Notes in Computer Science , Vol. 1997, Springer-Verlag Publishers, 2001. An earlier version appeared in the WebDB 2000 workshop.

Philippe Bonnet, J. E. Gehrke, and Praveen Seshadri. Towards Sensor Database Systems . In Proceedings of the Second International Conference on Mobile Data Management . Hong Kong, January 2001.

2000

Philippe Bonnet, J. E. Gehrke, and Praveen Seshadri. Querying the Physical World . IEEE Personal Communications, Vol. 7, No. 5, October 2000, pages 10-15. Special Issue on Smart Spaces and Environments.

J. Shanmugasundaram, E. Shekita, R. Barr, M. Carey, B. Lindsay, H. Pirahesh, B. Reinwald, Efficiently Publishing Relational Data as XML Documents, In Proceedings of the VLDB Conference, Cairo, Egypt, September 2000.

J. E. Gehrke, Raghu Ramakrishnan, and Venkatesh Ganti. RAINFOREST - A Framework for Fast Decision Tree Construction of Large Datasets. In Data Mining and Knowledge Discovery, Volume 4, Issue 2/3, July 2000 , pages 127-162.

Venkatesh Ganti, J. E. Gehrke, and Raghu Ramakrishnan . DEMON: Mining and Monitoring Evolving Data . In Proceedings of the 16th International Conference on Data Engineering , San Diego, California, February 2000. Best student paper award.

Zhiyuan Chen and Praveen Seshadri: An Algebraic Compression Framework for Query Results. In Proceedings of the 16th International Conference on Data Engineering , San Diego, California, February 2000, pages 177-188.

Philippe Bonnet, Praveen Seshadri : Device Database Systems. In Proceedings of the 16th International Conference on Data Engineering , San Diego, California, February 2000.

1999

J. Shanmugasundaram, K. Tufte, G. He, C. Zhang, D. DeWitt, J. Naughton, " Relational Databases for Querying XML Documents: Limitations and Opportunities ," In Proceedings of the VLDB Conference, Edinburgh, Scotland, September 1999.

Venkatesh Ganti, J. E. Gehrke, and Raghu Ramakrishnan. Mining very large databases. IEEE Computer, Vol. 32, No. 9,  August 1999 , pages 38-45.

J. Shanmugasundaram, U. Fayyad, P. Bradley, " Compressed Data Cubes for OLAP Aggregate Query Approximation on Continuous Dimensions ", In Proceedings of the 1999 SIGKDD Conference, San Diego, California, August 1999.

Venkatesh Ganti, J. E. Gehrke, and Raghu Ramakrishnan . CACTUS--Clustering Categorical Data Using Summaries . In Proceedings of the 1999 SIGKDD Conference , San Diego, California, August 1999.

J. Shanmugasundaram, A. Nithrakashyap, R. Sivasankaran, K. Ramamritham, "Efficient Concurrency Control for Broadcast Environments", In Proceedings of the 1999 SIGMOD Conference, Philadelphia, Pennsylvania, June 1999.

J. E. Gehrke, Venkatesh Ganti, Raghu Ramakrishnan , and Wei-Yin Loh. BOAT -- Optimistic Decision Tree Construction . In Proceedings of the 1999 SIGMOD Conference , Philadelphia, Pennsylvania, June 1999.

Tobias Mayr and Praveen Seshadri: Client-Site Query Extensions. In Proceedings of the 1999 SIGMOD Conference , Philadelphia, Pennsylvania, June 1999, pages 347-358.

Philippe Bonnet, Kyle Buza , Zhiyuan Chen , Victor Cheng , Randolph Chung , Takako M. Hickey , Ryan Kennedy , Daniel Mahashin , Tobias Mayr , Ivan Oprencak , Praveen Seshadri and Hubert Siu : The Cornell Jaguar System: Adding Mobility to PREDATOR. In Proceedings of the 1999 SIGMOD Conference , Philadelphia, Pennsylvania, June 1999, pages 580-581.

Venkatesh Ganti, J. E. Gehrke, Raghu Ramakrishnan , and Wei-Yin Loh. A Framework for Measuring Changes in Data Characteristics . In Proceedings of the Eighteenth ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems , Philadelphia, Pennsylvania, May 1999. (Invited to Journal of Computer Science and Systems (JCSS).)

Francis Chu , Joseph Y. Halpern , and Praveen Seshadri: Least Expected Cost Query Optimization: An Exercise in Utility. In Proceedings of the Eighteenth ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems , Philadelphia, Pennsylvania, May 1999, pages 138-147.

Venkatesh Ganti, Raghu Ramakrishnan, J. E. Gehrke, Allison L. Powell, and James French. Clustering Large Datasets in Arbitrary Metric Spaces . In Proceedings of the Fifteenth International Conference on Data Engineering , Sidney, Australia, 1999.

1998

N. Gehani, K. Ramamritham, J. Shanmugasundaram, O. Shmueli, " Accessing Extra-Database Information: Concurrency Control and Correctness ", Information Systems: An International Journal, 23(7), pp. 439-462, 1998.

Praveen Seshadri: Enhanced Abstract Data Types in Object-Relational Databases. VLDB Journal 7 (3): 130-140 (1998).

J. E. Gehrke, Raghu Ramakrishnan, and Venkatesh Ganti. RAINFOREST - A Framework for Fast Decision Tree Construction of Large Datasets . In Proceedings of the Twenty-fourth International Conference on Very Large Data Bases , New York, New York, 1998.

Rakesh Agrawal, J. E. Gehrke, Dimitrios Gunopulos, and Prabhakar Raghavan . Automatic Subspace Clustering of High Dimensional Data for Data Mining Applications . In Proceedings of the 1998 SIGMOD Conference, Seattle, Washington, June 1998.

Michael Godfrey , Tobias Mayr , Praveen Seshadri, and Thorsten von Eicken : Secure and Portable Database Extensibility. In Proceedings of the 1998 SIGMOD Conference , Seattle, Washington, June 1998, pages 390-401.

Praveen Seshadri: Predator: A Resource for Database Research. SIGMOD Record 27 (1): 16-20 (1998).

1997

Michael J. Carey, David J. DeWitt, Jeffrey F. Naughton , Mohammad Asgarian, J.E. Gehrke, and Dhaval N. Shah. The BUCKY Object-Relational Benchmark . In Proceedings of the 1997 SIGMOD Conference , Tucson, Arizona, May 1997. More material , including the data generator used in the benchmark.