CS 775: Seminar in Natural Language Understanding

Fall 2004

Mondays, 4:30-5:30 p.m.

Upson 5160

Seminar plans: This semester, we will convene to present and discuss research papers from recent (usually this year's) NLP conferences as well as to discuss ongoing work in Cornell's NLP group. If you want to be added to the email announcement list please contact Claire Cardie: cardie@cs.cornell.edu.

Date Paper
Sep 06 Initial meeting
Sep 13 Wilson, Theresa, Wiebe, Janyce, & Hwa, Rebecca (2004). Just how mad are you? Finding strong and weak opinion clauses. Proc. 19th National Conference on Artificial Intelligence (AAAI-2004).
Sep 20 Tadashi Nomoto, Multi-Engine Machine Translation with Voted Language Model,  ACL 2004.
Sep 27 Taskar et al., Max-Margin Parsing, EMNLP 2004.
Oct 4 Prem Melville and Raymond J. Mooney. Diverse Ensembles for Active Learning
Proceedings of the 21st International Conference on Machine Learning (ICML-2004), pp. 584-591, Banff, Canada, July 2004.

Sugato Basu, Arindam Banerjee, and Raymond J. Mooney, Active Semi-Supervision for Pairwise Constrained Clustering.
Proceedings of the SIAM International Conference on Data Mining (SDM-2004), pp. 333-344, Lake Buena Vista, FL, April 2004.
Oct 11

FALL BREAK

Oct 18 No class 
Oct 25 Dan Roth and Wen-tau Yih. Linear Programming Formulation for Global Inference in Natural Language Tasks. CoNLL 2004.
Nov 1 Rie Kubota Ando. Semantic Lexicon Construction: Learning from Unlabeled Data via Spectral Analysis. CoNLL 2004.
Nov 8 Michael Collins and Scott Miller. Semantic Tagging using a Probabilistic Context Free Grammar.
In Proceedings of the Sixth Workshop on Very Large Corpora
1998
Nov 15 Michael Collins and Nigel Duffy. Convolution Kernels for Natural Language. NIPS 2001.
Nov 22 Yang X, J Su, G Zhou and C L Tan. An NP-cluster based approach to coreference resolution, 20th
International Conference on Computational Linguistics, COLING04, 23-27 Aug 2004
Papers suggested for discussion thus far...

Wesley Hildebrandt, Boris Katz and Jimmy Lin. Answering Definition Questions with Multiple Knowledge Sources. From HLT-NAACL 2004.

Scott Miller, Jethran Guinness and Alex Zamanian. Name Tagging with Word Clusters and Discriminative Training. From HLT-NAACL 2004.

Francine Chen, Ayman Farahat and Thorsten Brants. Multiple Similarity Measures and Source-Pair Information in Story Link Detection. From HLT-NAACL 2004.

Razvan Bunescu and Raymond J. Mooney. Collective Information Extraction with Relational Markov Networks. From ACL 2004.

Andrew Kehler, Douglas Appelt, Lara Taylor and Aleksandr Simma. The (Non)Utility of Predicate-Argument Frequencies for Pronoun Interpretation. From HLT-NAACL 2004.

Jonathan Graehl and Kevin Knight. Training Tree Transducers. From HLT-NAACL 2004.

Dan Roth and Wen-tau Yih. Linear Programming Formulation for Global Inference in Natural Language Tasks. From CoNLL 2004.

John Chen and Owen Rambow. Use of Deep Linguistic Features for the Recognition and Labeling of Semantic Arguments. EMNLP 2003.

Daniel Gildea  and  Julia Hockenmaier. Identifying Semantic Roles Using Combinatory Categorial Grammar. EMNLP 2003.

Rie Kubota Ando. Semantic Lexicon Construction: Learning from Unlabeled Data via Spectral Analysis. From CoNLL 2004.

R Florian, H Hassan, A Ittycheriah, H Jing, N Kambhatla, X Luo, N Nicolov and S Roukos. A Statistical Model for Multilingual Entity Detection and Tracking. From HLT-NAACL 2004.

Joyce Y. Chai and Rong Jin. Discourse Structure for Context Question Answering. From HLT-NAACL-QA workshop-2004.

Sharon Small, Tomek Strzalkowski, Ting Liu, Sean Ryan, Robert Salkin, Nobuyuki Shimizu, Paul Kantor, Diane Kelly, Robert Rittman, Nina Wacholder and Boris Yamrom. HITIQA: Scenario Based Question Answering. From HLT-NAACL-QA workshop-2004.

Stetina and Nagao 1997 WVLC,  Corpus-based PP attachment ambiguity resolution with a semantic dictionary.

There is a running argument between language-modeling and probabilistic-modeling approaches to IR. To get an idea of what's behind the controversy, we could read the first part of the Lafferty paper http://la.lti.cs.cmu.edu/callan/Workshops/lmir01/WorkshopProcs/Papers/lafferty.pdf and the Zhai plus Sparck Jones and Robertson paper http://la.lti.cs.cmu.edu/callan/Workshops/lmir01/WorkshopProcs/Papers/ksj2.pdf. 


Organizer: Claire Cardie

Previous runnings: F98, S00, S01 (as Statistical Natural Language Processing: Models and Methods), F01, S02, F02, S03, F03, S04.

See also the AI graduate studies page or the Cornell NLP page.



CS775, Fall '04

Back to CS course websites