Database Management Systems

Where is the Life we have lost in living?
Where is the wisdom we have lost in knowledge?
Where is the knowledge we have lost in information? -- T.S. Elliot

Logistics

Overview

The course has two parts. The first part (before spring break) covers a basic background on system abstractions and algorithmic tools for managing large datasets. The second part (after spring break) covers current research topics in this area such as the management of structured and unstructured data, search, and data mining. The course prerequisites are algorithms and probability (at the level of CS482) and a basic background in systems (at the level of CS414).

The main work for the course consists of a paper summary for each class (the 20 best paper summaries will count in total for 20% of your grade), short answers to questions about the papers (the 20 best answers will count in total for 20% of your grade), the presentation of one paper in the course (10% of your grade), and a course project (50% of your grade).

In the project, I expect you to try to do original research. The project encompasses the following steps:

Course Outline (Draft)

Note: Write your summary about the paper marked with a (*).

Part I: The Classics

Data Models

January 25

January 30:

Query Optimization and Query Processing

February 1

February 6

February 8

Access Methods

February 13

Storage

February 15

One-Pass Query Processing and Sampling

February 20

February 22

Transaction Management

February 27

March 1

March 6

A Detour: Web Services and Databases

March 8

Transaction Management (Contd.)

March 13

Transaction Management (Contd.)

March 15

 

Part II: The Nouveaux (After Spring Break)

Decision Support and OLAP

March 27 (Guest lecture by Alan Demers)

March 29 (Guest lecture by Mirek Riedewald)

Data Mining

April 3 (Lakshmi)

April 5 (Yunsong Guo)

Deductive Database Systems

April 10 and April 12 (Parvati will give one lecture)

Benchmarking

April 17 (Chi Ho)

Data Stream Processing

April 19 (Philipp Unterbrunner)

April 24. (Adam Arbree)

April 26 (Guest lecture by Mingsheng Hong)

Main memory database systems (Last week of classes: No more summaries needed, work on your projects instead.)

May 1 (Nitin)

Database Privacy

May 3