Assignments

  Materials Due Date
Critique 1 J. Zobel. How reliable are large-scale information retrieval experiments. SIGIR 1998.

Critique guidelines

Mon 9/16
Assignment 1 pdf, html (last modified Mon 9/16)

Lemur IR toolkit

Lemur successful installations reported from cs574 students thus far:
Lemur-1.9.2-winxpbin under Windows XP (no problems)
Lemur 1.1 using Cygwin 1.3.12-2 under Windows XP (but problems with 1.9 and 1.9.2)

Mon 9/30
Critique 2 Dumais et al. Web Question Answering: Is More Always Better? SIGIR 2001. Weds 10/2
Prelim 1 The in-class prelim will test any material covered in class, in the readings (excluding the ones labeled "background" or "extra"), and in the critiqued papers. Mon 10/7
Assignment 2 Building a simple QA system
questions.txt
answers.txt
Mon 10/28
Critique 3 Blum & Mitchell, Combining Labeled and Unlabeled Data with Co-Training, Proceedings of the 1998 Conference on Computational Learning Theory, July 1998. Wed 10/30
Assignment 3 Text Classification

arxiv_doc.train.gz, arxiv_classes.train.gz
arxiv_doc.test.gz, arxiv_classes.test.gz

SVM-light, KNN

Wed 11/13
Critique 4 Thelen. and Riloff, A Bootstrapping Method for Learning Semantic Lexicons using Extraction Pattern Contexts, Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing, EMNLP 2002. Wed 12/4