CS 674 Natural Language Processing
Spring 2007

MW 11:15-12:05
Thurston 202

Mats Rooth
mr249@cornell.edu
Morrill 203A

Notices and description

There is a page with notices.

The course is a graduate-level introduction to NLP, covering computational morphology, syntactic formalisms and parsing, markup and evaluation methodology, statistical models, and laboratory methods. (Students who are uncertain whether they have enough computational and mathematical preparation should consult me; special arrangements can be made. Introduction to Computational Linguistics, which is offered in Fall 2007, is more suitable for students with little computational preparation.)

Textbooks

The 2nd edition of Jurafsky and Martin is not available yet. We will use preprints of chapters from the new edition together with chapters from the 1st edition ( draft chapters ).

Course requirements

Topics

  1. Morphology and finite state transducers
  2. Tree syntax of natural language; Context free and other tree grammars
  3. Parsing tree syntax
  4. N-grams
  5. Word classes and tagging
  6. Probabilistic tree models
  7. Computational morphology/phonology
  8. Sequence models
  9. Features and unification
  10. Semantics
  11. Computational lexical semantics

Organization

Assignments and some materials will be posted and submitted through CMS.

Academic Integrity

Your conduct in the course is governed by CS and Cornell policies on academic integrity.

These particular policies apply:

  1. Except for lab problems, assignments must be your own work. This does not exclude discussion of points of interpretation or general strategy.
  2. Work on lab problems may be joint; submit your own result and credit the people you worked with.
  3. Any resources (e.g. publications and NLP toolkits) used in assignments, projects, or presentations must be credited.