CS 430
Information Discovery
Fall 2002

Syllabus


This preliminary syllabus can be expected to change as the course progresses.

Class Formats

Classes are divided into two formats. 

Lectures
Each week there will be two conventional lectures in usual format.  PowerPoint slides will be available on the web.
 
Discussions
Class discussions on Wednesday evenings will be around readings from the text book or other papers to be read before the class.  It is essential that that everybody comes to class well prepared.

Week 1

Date Event Topic
Thursday 8/29 Lecture 1 Overview of information discovery
[PowerPoint, HTML]

Week 2

Date Event Topic
Tuesday 9/3 Lecture 2 Text based information retrieval
[PowerPoint, HTML]
Wednesday 9/4 Discussion 1 Inverted files [reading]
[PowerPoint, HTML]
Thursday 9/5 Lecture 3 Inverted files
[PowerPoint, HTML]

Week 3

Date Event Topic
Tuesday 9/10 Lecture 4 Vector methods
[PowerPoint, HTML]
Wednesday 9/11 Discussion 2 Lexical analysis and stoplists [reading]
[PowerPoint, HTML]
Thursday 9/12 Lecture 5 Ranking
[PowerPoint, HTML]

Week 4

Date Event Topic
Tuesday 9/17 Lecture 6 Data structures for text processing
[PowerPoint, HTML]
Wednesday 9/18 Discussion 3 Stemming [reading]
[PowerPoint HTML]
Thursday 9/19 Lecture 7 Evaluation of retrieval effectiveness: Cranfield and TREC
[PowerPoint, HTML]
Friday 9/20 Assignment 1 due  

Week 5

Date Event Topic
Tuesday 9/24 Lecture 8 Evaluation of retrieval effectiveness II
[PowerPoint, HTML]
Wednesday 9/25 Discussion 4 Ranking methods [reading]
[PowerPoint, HTML]
Thursday 9/26 Lecture 9 Boolean methods
[PowerPoint, HTML]

Week 6

Date Event Topic
Tuesday 10/1 Lecture 10 Probabilistic information retrieval
[PowerPoint, HTML]
Wednesday 10/2 Discussion 5 Latent semantic indexing [reading]
[PowerPoint, HTML]
Thursday 10/3 Lecture 11 Latent semantic indexing
[PowerPoint, HTML]

Week 7

Date Event Topic
Tuesday 10/8 Lecture 12 Library catalogs, MARC cataloguing,
[PowerPoint, HTML]
Wednesday 10/9 Discussion 6 Dublin Core [reading]
[PowerPoint, HTML]
Thursday 10/10 Lecture 13 Dublin Core:  Automatic extraction of catalog records
[PowerPoint, HTML]

Week 8

Date Event Topic
Tuesday 10/15 [break]
Wednesday 10/16 Discussion 7 User interfaces [reading]
[PowerPoint, HTML]
Thursday 10/17 Lecture 14 Usability 1
[PowerPoint, HTML]

Week 9

Date Event Topic
Monday 10/21 Assignment 2 due
Tuesday 10/22 Lecture 15 Guest lecture: Carl Lagoze, Distributed information retrieval
[PowerPoint, HTML]
Wednesday 10/23 Midterm examination
Thursday 10/24 Lecture 16 Usability 2
[PowerPoint, HTML]

Week 10

Date Event Topic
Tuesday 10/29 Lecture 17 Web crawlers
[PowerPoint, HTML]
Wednesday 10/30 Discussion 8 Google [reading]
[PowerPoint, HTML]
Thursday 10/31 Lecture 18 Web search systems
[PowerPoint, HTML]

Week 11

Date Event Topic
Tuesday 11/5 Lecture 19 Non-textual materials 1
[PowerPoint, HTML]
Wednesday11/6 Discussion 9 Informedia [reading]
[PowerPoint, HTML]
Thursday 11/7 Lecture 20 Non-textual materials 2
[PowerPoint, HTML]

Week 12

Date Event Topic
Monday 11/11 Assignment 3 due
Tuesday 11/12 Lecture 21 Thesaurus examples
[PowerPoint, HTML]
Wednesday 11/13 Discussion 10 Thesaurus construction [reading]
[PowerPoint, HTML]
Thursday 11/14 Lecture 22 Cluster analysis 1
[PowerPoint, HTML

Week 13

Date Event Topic
Tuesday 11/19 Lecture 23 Cluster analysis 2 and thesaurus construction
[PowerPoint, HTML]
Wednesday 11/20 Discussion 11 Clustering [reading]
[PowerPoint, HTML]
Thursday 11/21 Lecture 24 Guest lecture: Thorsten Joachims, Machine learning
[PDF]

Week 14

Date Event Topic
Tuesday 11/26 Lecture 25 Query refinement
[PowerPoint, HTML]
Wednesday 11/27 [break]
Thursday 11/28 [break]

Week 15

Date Event Topic
Tuesday 12/3 [no class]
Wednesday 12/4 Discussion 12 User interface concepts
[PowerPoint, HTML]
Thursday 12/5 Lecture 26 Architecture of information retrieval systems
[PowerPoint, HTML]
Friday 12/6 Assignment 4 due  

Examinations

Date Event
Thursday 12/12 Early examination, Information Science Building, noon to 1:30
Wednesday 12/18 Final examination, Olin 165, noon to 1:30

[CS 430 Home Page]

William Y. Arms
(wya@cs.cornell.edu)
Last changed: November 30, 2002