General resources
Homepage
for my previous running of this
course
click here for Fall 07
Reference texts
- Ricardo Baeza-Yates and Berthier Ribeiro-Neto, Modern Information
Retrieval (1999). US mirror.
- Rik Belew, Finding
Out About: A Cognitive Perspective on Search Engine Technology and
the WWW
(2000).
- Eugene Charniak, Statistical Language Learning, 1993.
- Bruce Croft, Don Metzler, and Trevor Strohman, Search Engines: Information Retrieval in Practice (2010). Some chapters available online.
- W. Bruce Croft and John Lafferty, editors, Language Modeling
for Information Retrieval (2003).
- William B. Frakes and Ricardo Baeza-Yates, Information
Retrieval: Data Structures & Algorithms (1992).
- Daniel Jurafsky and James H. Martin, Speech and Language
Processing: An Introduction to Natural Language Processing,
Computational Linguistics, and Speech Recognition, 2nd edition
(2008).
- Christopher D. Manning, Prabhakar Raghavan, and Hinrich
Schütze, Introduction
to Information Retrieval,
(2008).
- Christopher D. Manning and Hinrich Schuetze, Foundations of
Statistical Natural Language Processing (1999). Completely online via Cornell.
- Fernando C. N. Pereira and Stuart M. Shieber. Prolog and
Natural-Language Analysis. (1987).
- C. J. van Rijsbergen, Information
Retrieval, second edition (1979).
Alternate version
w/o indexing or page divisions.
- Karen Spärck Jones and Peter Willett. Readings in
Information Retrieval (1997).
Pointers to papers
Alistair Moffat, Justin Zobel, and David Hawking, Recommended
reading for IR research students, SIGIR Forum 39(2):3–14, 2005.
[pdf,pdf2]
Cornell IR/NLP courses
Datasets and software: