Structured Prediction for Natural Language Processing

Fall 2015, CS 6741
Time: Monday and Wednesday, 2:55pm-4:10pm
Room: Gates 405 (Ithaca) / Touchdown (Tech)
Instructor: Yoav Artzi
Contact: Piazza Discussion Group [join here]
Office hours: by appointment (contanct instructor)

Robust language understanding has the potential to transform how we interact with computers, extract information from text and study language on large scale. However, to accurately recover the meaning of language, automated systems must learn to reason about the meaning of words and the intricate structures they combine to. This research-oriented course examines machine learning and inference methods for recovering structured representations of language meaning. Possible topics include formalisms, inference and learning for: sequence models (tagging, named-entity recognition), tree models (constituency and dependency parsing), mapping sentences to logical form representations and alignment models (machine translation).

Choosing among NLP courses: How do I know which one is right for me?

In 2015-2016, we are blessed with a plethora of NLP-related offerings! At the graduate level:

If you are interested in extracting information and meaning from text through machine learning techniques, then consider taking CS6740/IS6300, Advanced Language Technologies (offered Spring 2016. To get a feel for that the course will be like, see the Fall 2012 offering).
If you are interested in studying formal representation of language meaning, and designing algorithms to learn to map sentences to such representations, then consider taking CS6741, Structured Prediction for Natural Language Processing (offered Fall 2015).
If you are interested in exploring the social aspects of language and its role in online interactions, then consider taking CS6742, Natural Language Processing and Social Interaction (offered Fall 2015. To get a feel for what the course will be like, see the Fall 2014 offering).

All three courses fulfill the same CS graduate course requirements. If you are truly passionate about NLP research, we would love to see you in all of these courses. For undergraduate courses on offer, consult the Cornell NLP course list.

Lectures

Schedule and the details of the topics covered are subject to change.

Resources

Michael Collins, Notes on Statistical NLP (on Michael's website)
Yoav Artzi, Semantic Parsing with CCG (tutorial)

Text Books

Recommended: D. Jurafsky & James H. Martin, Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics and Speech Recognition, Prentice Hall, Second Edition, 2009. (J&M)
Optional: N.A. Smith, Linguistic Structure Prediction, Morgan and Claypool, 2011 (Smith)
Optional: C.D. Manning & H. Schuetze, Foundations of Statistical Natural Language Processing, Cambridge: MIT Press, 1999 (M&S)

Course Procedures: Requirements, Grading, and Policies

Requirements

The course will include an assignment, a class project, and, potentially, a paper presentaiton. There are two options for the course project: original research or paper re-implementation. The goal of original research is to generate a ACL-level research paper (ACL/NAACL/EMNLP/TACL are the top-tier venues for NLP research). Although publication is not a condition for success, the research question, evaluation and analysis must be at the level of an ACL paper. Negative results will be accepted. The goal of paper re-implementation is to take an existing well-written and influencial paper, re-implement it and study the approach, including re-creating the results, conducting further experiments and releasing a well-documented implementation (releasing is highly encouraged, but is not mandatory). The set of papers available for re-implementation will be posted in the beginning of the semester.

Grading

Your grade will be determined by the assignment (20%), participation (10%), and the class project (70%). We may add a paper presentation in class, which will be graded (10%), and then the class project will count towards 60% of the grade.

Policies

Collaboration: Collaboration policy will be announced in the beginning of the semester.

External code: For the final project you may use external code within certain limits. If you are doing a research project, you may use code to the extent that your project still makes a ACL-level contribution. If you are re-implmenting an existing paper, you may use external code similar to the original paper. For all other assignments, you may not use external code unless without explicit permission. If you are not sure, please consult the instructor on the discussion board.

Late policy: Each student has a total of two "slip days" that may be used for late submissions without penalty for assignments and the final project. If working in groups, late days count for all members of the group. For example, to submit one day late, each group member must have at least one late day left.

Prerequisites and Enrollment

Prerequisites: CS 2110 or equivalent programming experience, a course in machine learning (CS 4780/CS 5780, CS 6780 or equivalent). The course is open to master students with instructor permission. If you have any questions regarding enrollment, show up on the first day of class. Enrollment questions will be addressed then.