 |
CS 430
Information Discovery
Spring 2001
Syllabus |
This preliminary syllabus can be expected to change as the course progresses.
Class Formats
Classes are divided into two formats. Nomadic laptop computers can be used in all of
them, but some guest speakers may prefer that they not be used.
- Lectures
- Each week there will be one or two conventional lectures in usual format.
PowerPoint slides will usually be available on the web for access from laptop computers.
-
- Discussions
- Class discussions on Wednesday evenings will be around readings from the
text book or other papers to be read before
the class. Details may be checked online during class, but it is essential that that
everybody comes to class well prepared.
Week 1: Introduction to information retrieval
Date |
Event |
Topic |
Tuesday 1/23 |
Lecture 1 |
Overview of information discovery |
Wednesday 1/24 |
Discussion 1 |
Introduction to information retrieval [reading] |
Thursday 1/25 |
Lecture 2 |
Basic concepts of information retrieval |
Week 2: File structures
Date |
Event |
Topic |
Tuesday 1/30 |
Lecture 3 |
Inverted files |
Wednesday 1/31 |
Discussion 2 |
Inverted files [reading] |
Thursday 2/1 |
Lecture 4 |
File structures for inverted files |
Week 3: Descriptive metadata 1
Date |
Event |
Topic |
Tuesday 2/6 |
Lecture 5 |
Descriptive metadata1: Library catalogs, Dublin Core |
Wednesday 2/7 |
Discussion 3 |
Dublin Core [readings] |
Thursday 2/8 |
Lecture 6 |
MARC cataloguing |
Friday 2/9 |
Assignment 1 due |
Basic techniques of information retrieval |
Week 4: Descriptive
metadata 2 / Automatic indexing 1
Date |
Event |
Topic |
Tuesday 2/13 |
Lecture 7 |
Automatic extraction of catalog records |
Wednesday 2/14 |
Discussion 4 |
Lexical analysis and stop lists [reading] |
Thursday 2/15 |
Lecture 8 |
Automatic term extraction and weighting |
Week 5: Automatic indexing 2
Date |
Event |
Topic |
Tuesday 2/20 |
Lecture 9 |
Vector methods |
Wednesday 2/21 |
[No discussion class] |
|
Thursday 2/22 |
Lecture 10 |
Guest, Kamen Yotov. Web crawlers |
Week 6: Retrieval evaluation
Date |
Event |
Topic |
Tuesday 2/27 |
Lecture 11 |
Cranfield and TREC |
Wednesday 2/28 |
Discussion 5 |
Stemming algorithms [reading] |
Thursday 3/1 |
Lecture 12 |
Evaluation of retrieval effectiveness |
Friday 3/2 |
Assignment 2 due |
Descriptive metadata |
Week 7: Thesauruses 1
Date |
Event |
Topic |
Tuesday 3/6 |
Lecture 13 |
Examples of thesauruses |
Wednesday 3/7 |
Mid-term examination |
|
Thursday 3/8 |
Lecture 14 |
Guest, Carl Lagoze. Distributed
information discovery |
Week 8: Thesauruses 2
Date |
Event |
Topic |
Tuesday 3/13 |
Lecture 15 |
Guest, Kamen Yotov. Google: Case study |
Wednesday 3/14 |
Discussion 6 |
Thesaurus constructon [reading] |
Thursday 3/17 |
Lecture 16 |
Thesaurus construction |
Mid-term break
Week 9: Ranking algorithms
Date |
Event |
Topic |
Tuesday 3/27 |
Lecture 17 |
Ranking methods 1 |
Wednesday 3/28 |
Discussion 7 |
Ranking methods [reading] |
Thursday 3/29 |
Lecture 18 |
Ranking methods 2 |
Week 10: User interfaces
Date |
Event |
Topic |
Tuesday 4/3 |
Lecture 19 |
User interfaces for information discovery |
Wednesday 4/4 |
Discussion 8 |
A user interface for text searches [reading] |
Thursday 4/5 |
Lecture 20 |
The user in the loop |
Friday 4/6 |
Assignment 3 due |
Textual analysis and automatic indexing |
Week 11: User interfaces / query modification
Date |
Event |
Topic |
Tuesday 4/10 |
Lecture 21 |
Interactive retrieval |
Wednesday 4/11 |
Discussion 9 |
Relevance feedback and other query modification
techniques [reading] |
Thursday 4/12 |
Lecture 22 |
Non-textual materials 1 |
Week 12: Beyond text / web search systems
Date |
Event |
Topic |
Tuesday 4/17 |
Lecture 23 |
Non-textual materials 2 |
Wednesday 4/18 |
Discussion 10 |
Google [reading] |
Thursday 4/19 |
Lecture 24 |
[Cancelled] |
Week 13: Beyond text
Date |
Event |
Topic |
Tuesday 4/24 |
Lecture 25 |
Automatic indexing and metadata-based retrieval |
Wednesday 4/25 |
Discussion 11 |
Informedia [reading] |
Thursday 4/26 |
Lecture 26 |
Extending the Booelan model |
Friday 4/27 |
Assignment 4 due |
Fielded retrieval |
Week 14: Clustering
algorithms
Date |
Event |
Topic |
Tuesday 5/1 |
Lecture 27 |
Large-scale information discovery: the NSDL |
Wednesday 5/2 |
Discussion 12 |
Clustering algorithms [reading] |
Thursday 5/3 |
Lecture 28 |
Clustering and automatic classification |
Examination weeks
Date |
Event |
Thursday 5/10 |
Early examination, Upson 5130 1:00 to 3:00 p.m. |
Tuesday 5/15 |
Final examination, Kimball Hall B11 3:00 to 5:00 p.m. |
[CS 430 Home Page]
William Y. Arms
(wya@cs.cornell.edu)
Last changed: May 1, 2001