CIS 330: Applied Database Systems

Instructor: Johannes Gehrke http://www.cs.cornell.edu/johannes

Tuesdays & Thursdays 2:55-4:10pm

 

 

What is the technology behind ebay’s website?

 

How does Wal-Mart manage its 200 TB data warehouse?

 

How can we link together hundreds of biological databases

into one common virtual data repository?

 

How does Amazon recommend new books for you?

 

How can you mine nuggets of information in Wegman’s

transaction database?

 

 

This course introduces database systems with a focus on how to use them in practice. The course will give an overview of the capabilities of modern database systems, and how to build database-backed applications. Topics covered include the relational model, SQL, transactions, database design and tuning, three-tier architectures, web data management with XML, service-oriented architectures, data mining, and data warehousing. The course will include hands-on exercises using a commercial database system and a student-selected web-database project.  

 

Prerequisites: Basic knowledge of Java and some knowledge of data structures such as CS211 or equivalent.  Can be used towards the concentration/ minor in Information Science: www.cis.cornell.edu/infoscience

 

1x104 bytes

=

one page in an encyclopedia

5x106 bytes

=

complete works of Shakespeare

8x109 bytes

=

one DVD

1x1012 bytes

=

all X-ray films in a large hospital

4x1014 bytes

=

National Climatic Data Center database

2x1015 bytes

=

Google database

1.5x1018 bytes

=

data on 190,000 DVDs shipped daily by Netflix

5x1018 bytes

=

all words ever spoken by human beings since the beginning of time