CS 430
Information Discovery
Spring 2001

Syllabus

This preliminary syllabus can be expected to change as the course progresses.

Class Formats

Classes are divided into two formats. Nomadic laptop computers can be used in all of them, but some guest speakers may prefer that they not be used.

Lectures
Each week there will be one or two conventional lectures in usual format.  PowerPoint slides will usually be available on the web for access from laptop computers.
 
Discussions
Class discussions on Wednesday evenings will be around readings from the text book or other papers to be read before the class.  Details may be checked online during class, but it is essential that that everybody comes to class well prepared.

Week 1:  Introduction to information retrieval

Date Event Topic
Tuesday 1/23 Lecture 1 Overview of information discovery
Wednesday 1/24 Discussion 1 Introduction to information retrieval [reading]
Thursday 1/25 Lecture 2 Basic concepts of information retrieval

Week 2:  File structures

Date Event Topic
Tuesday 1/30 Lecture 3 Inverted files
Wednesday 1/31 Discussion 2 Inverted files [reading]
Thursday 2/1 Lecture 4 File structures for inverted files

Week 3: Descriptive metadata 1

Date Event Topic
Tuesday 2/6 Lecture 5 Descriptive metadata1: Library catalogs, Dublin Core
Wednesday 2/7 Discussion 3 Dublin Core [readings]
Thursday 2/8 Lecture 6 MARC cataloguing
Friday 2/9 Assignment 1 due Basic techniques of information retrieval

Week 4:  Descriptive metadata 2 / Automatic indexing 1

Date Event Topic
Tuesday 2/13 Lecture 7 Automatic extraction of catalog records
Wednesday 2/14 Discussion 4 Lexical analysis and stop lists [reading]
Thursday 2/15 Lecture 8 Automatic term extraction and weighting

Week 5: Automatic indexing 2

Date Event Topic
Tuesday 2/20 Lecture 9 Vector methods  
Wednesday 2/21 [No discussion class]  
Thursday 2/22 Lecture 10 Guest, Kamen Yotov.  Web crawlers 

Week 6: Retrieval evaluation

Date Event Topic
Tuesday 2/27 Lecture 11 Cranfield and TREC
Wednesday 2/28 Discussion 5 Stemming algorithms [reading]
Thursday 3/1 Lecture 12 Evaluation of retrieval effectiveness 
Friday 3/2 Assignment 2 due Descriptive metadata

Week 7: Thesauruses 1

Date Event Topic
Tuesday 3/6 Lecture 13 Examples of thesauruses
Wednesday 3/7 Mid-term examination  
Thursday 3/8 Lecture 14 Guest, Carl Lagoze.  Distributed information discovery

Week 8: Thesauruses 2

Date Event Topic
Tuesday 3/13 Lecture 15 Guest, Kamen Yotov.  Google: Case study
Wednesday 3/14 Discussion 6 Thesaurus constructon [reading]
Thursday 3/17 Lecture 16 Thesaurus construction

Mid-term break

Week 9: Ranking algorithms

Date Event Topic
Tuesday 3/27 Lecture 17 Ranking methods 1
Wednesday 3/28 Discussion 7 Ranking methods [reading]
Thursday 3/29 Lecture 18 Ranking methods 2

Week 10:  User interfaces

Date Event Topic
Tuesday 4/3 Lecture 19 User interfaces for information discovery
Wednesday 4/4 Discussion 8 A user interface for text searches [reading]
Thursday 4/5 Lecture 20 The user in the loop
Friday 4/6 Assignment 3 due Textual analysis and automatic indexing

Week 11: User interfaces / query modification

Date Event Topic
Tuesday 4/10 Lecture 21 Interactive retrieval
Wednesday 4/11 Discussion 9 Relevance feedback and other query modification techniques [reading]
Thursday 4/12 Lecture 22 Non-textual materials 1

Week 12: Beyond text / web search systems

Date Event Topic
Tuesday 4/17 Lecture 23 Non-textual materials 2
Wednesday 4/18 Discussion 10 Google [reading]
Thursday 4/19 Lecture 24 [Cancelled] 

Week 13: Beyond text

Date Event Topic
Tuesday 4/24 Lecture 25 Automatic indexing and metadata-based retrieval
Wednesday 4/25 Discussion 11 Informedia [reading]
Thursday 4/26 Lecture 26 Extending the Booelan model
Friday 4/27 Assignment 4 due Fielded retrieval

Week 14: Clustering algorithms

Date Event Topic
Tuesday 5/1 Lecture 27 Large-scale information discovery: the NSDL
Wednesday 5/2 Discussion 12 Clustering algorithms [reading]
Thursday 5/3 Lecture 28 Clustering and automatic classification

Examination weeks

Date Event
Thursday 5/10 Early examination, Upson 5130 1:00 to 3:00 p.m.
Tuesday 5/15 Final examination, Kimball Hall B11 3:00 to 5:00 p.m.

[CS 430 Home Page]

William Y. Arms
(wya@cs.cornell.edu)
Last changed: May 1, 2001