CS 775: Seminar in Natural Language Understanding

Fall 2004

Mondays, 4:30-5:30 p.m.

Upson 5160

Seminar plans: This semester, we will convene to present and discuss research papers from recent (usually this year's) NLP conferences as well as to discuss ongoing work in Cornell's NLP group. If you want to be added to the email announcement list please contact Claire Cardie: cardie@cs.cornell.edu.

Date	Paper
Sep 06	Initial meeting
Sep 13	Wilson, Theresa, Wiebe, Janyce, & Hwa, Rebecca (2004). Just how mad are you? Finding strong and weak opinion clauses. Proc. 19th National Conference on Artificial Intelligence (AAAI-2004).
Sep 20	Tadashi Nomoto, Multi-Engine Machine Translation with Voted Language Model, ACL 2004.
Sep 27	Taskar et al., Max-Margin Parsing, EMNLP 2004.
Oct 4	Prem Melville and Raymond J. Mooney. Diverse Ensembles for Active Learning Proceedings of the 21st International Conference on Machine Learning (ICML-2004), pp. 584-591, Banff, Canada, July 2004. Sugato Basu, Arindam Banerjee, and Raymond J. Mooney, Active Semi-Supervision for Pairwise Constrained Clustering. Proceedings of the SIAM International Conference on Data Mining (SDM-2004), pp. 333-344, Lake Buena Vista, FL, April 2004.
Oct 11	FALL BREAK
Oct 18	No class
Oct 25	Dan Roth and Wen-tau Yih. Linear Programming Formulation for Global Inference in Natural Language Tasks. CoNLL 2004.
Nov 1	Rie Kubota Ando. Semantic Lexicon Construction: Learning from Unlabeled Data via Spectral Analysis. CoNLL 2004.
Nov 8	Michael Collins and Scott Miller. Semantic Tagging using a Probabilistic Context Free Grammar. In Proceedings of the Sixth Workshop on Very Large Corpora 1998
Nov 15	Michael Collins and Nigel Duffy. Convolution Kernels for Natural Language. NIPS 2001.
Nov 22	Yang X, J Su, G Zhou and C L Tan. An NP-cluster based approach to coreference resolution, 20th International Conference on Computational Linguistics, COLING04, 23-27 Aug 2004

Papers suggested for discussion thus far...

Wesley Hildebrandt, Boris Katz and Jimmy Lin. Answering Definition Questions with Multiple Knowledge Sources. From HLT-NAACL 2004.

Scott Miller, Jethran Guinness and Alex Zamanian. Name Tagging with Word Clusters and Discriminative Training. From HLT-NAACL 2004.

Francine Chen, Ayman Farahat and Thorsten Brants. Multiple Similarity Measures and Source-Pair Information in Story Link Detection. From HLT-NAACL 2004.

Razvan Bunescu and Raymond J. Mooney. Collective Information Extraction with Relational Markov Networks. From ACL 2004.

Andrew Kehler, Douglas Appelt, Lara Taylor and Aleksandr Simma. The (Non)Utility of Predicate-Argument Frequencies for Pronoun Interpretation. From HLT-NAACL 2004.

Jonathan Graehl and Kevin Knight. Training Tree Transducers. From HLT-NAACL 2004.

Dan Roth and Wen-tau Yih. Linear Programming Formulation for Global Inference in Natural Language Tasks. From CoNLL 2004.

John Chen and Owen Rambow. Use of Deep Linguistic Features for the Recognition and Labeling of Semantic Arguments. EMNLP 2003.

Daniel Gildea and Julia Hockenmaier. Identifying Semantic Roles Using Combinatory Categorial Grammar. EMNLP 2003.

Rie Kubota Ando. Semantic Lexicon Construction: Learning from Unlabeled Data via Spectral Analysis. From CoNLL 2004.

R Florian, H Hassan, A Ittycheriah, H Jing, N Kambhatla, X Luo, N Nicolov and S Roukos. A Statistical Model for Multilingual Entity Detection and Tracking. From HLT-NAACL 2004.

Joyce Y. Chai and Rong Jin. Discourse Structure for Context Question Answering. From HLT-NAACL-QA workshop-2004.

Sharon Small, Tomek Strzalkowski, Ting Liu, Sean Ryan, Robert Salkin, Nobuyuki Shimizu, Paul Kantor, Diane Kelly, Robert Rittman, Nina Wacholder and Boris Yamrom. HITIQA: Scenario Based Question Answering. From HLT-NAACL-QA workshop-2004.

Stetina and Nagao 1997 WVLC, Corpus-based PP attachment ambiguity resolution with a semantic dictionary.

There is a running argument between language-modeling and probabilistic-modeling approaches to IR. To get an idea of what's behind the controversy, we could read the first part of the Lafferty paper http://la.lti.cs.cmu.edu/callan/Workshops/lmir01/WorkshopProcs/Papers/lafferty.pdf and the Zhai plus Sparck Jones and Robertson paper http://la.lti.cs.cmu.edu/callan/Workshops/lmir01/WorkshopProcs/Papers/ksj2.pdf.

Organizer: Claire Cardie

Previous runnings: F98, S00, S01 (as Statistical Natural Language Processing: Models and Methods), F01, S02, F02, S03, F03, S04.

See also the AI graduate studies page or the Cornell NLP page.

CS775, Fall '04

Back to CS course websites