EMNLP 2001 Program

EMNLP 2001 Conference Program

Paper titles link to on-line proceedings.

SUNDAY, JUNE 3

8:45 9:00 Welcome

Session I: Learning
Session Chair: Paola Velardi

9:00 9:25 Limitations of Co-Training for Natural Language Learning from Large Datasets

David Pierce and Claire Cardie

9:25 9:50 A Sequential Model for Multi-Class Classification

Yair Even-Zohar and Dan Roth

9:50 10:15 Learning Within-Sentence Semantic Coherence

Elena Eneva, Rose Hoberman and Lucian Lita

10:15 10:45 Break

Session II: Machine Translation
Session Chair: Kenneth Church

10:45 11:10 Knowledge Sources for Word-Level Translation Models

Philipp Köhn and Kevin Knight

11:10 11:35 Improving Lexical Mapping Model of English-Korean Bitext Using Structural Features

Seonho Kim, Juntae Yoon and Mansuk Song

11:35 11:45 Short break

11:45 12:45 Invited talk, Eric Brill
"Paucity Shmaucity -- What Can We Do With A Trillion Words?"

12:45 2:00 LUNCH

Session III: Text Categorization
Session Chair: Wessel Kraaij

2:00 2:25 Stacking Classifiers for Anti-Spam Filtering of E-Mail

Georgios Sakkis, Ion Androutsopoulos, Georgios Paliouras, Vangelis Karkaletsis, Constantine D. Spyropoulos and Panagiotis Stamatopoulos

2:25 2:50 Feature Space Restructuring for SVMs with Application to Text Categorization

Hiroya Takamura and Yuji Matsumoto

2:50 3:15 Using Bins to Empirically Estimate Term Weights for Text Categorization

Carl Sable and Kenneth W. Church

3:15 3:25 Short break

Session IV: Question Answering and Information Extraction
Session Chair: Roman Yangarber

3:25 3:50 Question Answering Using a Large Text Database: A Machine Learning Approach

Hwee Tou Ng, Jennifer Lai Pheng Kwan and Yiyuan Xia

3:50 4:15 Information Extraction Using the Structured Language Model

Ciprian Chelba and Milind Mahajan

4:15 4:30 Short break

4:30 5:30 Panel: When does EM work? (includes an introduction to the Expectation-Maximization algorithm)
Eugene Charniak, Kevin Knight, Ted Pedersen, Stefan Riezler

	SUNDAY, JUNE 3
8:45	9:00	Welcome
	Session I: Learning Session Chair: Paola Velardi
9:00	9:25	Limitations of Co-Training for Natural Language Learning from Large Datasets
David Pierce and Claire Cardie
9:25	9:50	A Sequential Model for Multi-Class Classification
Yair Even-Zohar and Dan Roth
9:50	10:15	Learning Within-Sentence Semantic Coherence
Elena Eneva, Rose Hoberman and Lucian Lita
10:15	10:45	Break
	Session II: Machine Translation Session Chair: Kenneth Church
10:45	11:10	Knowledge Sources for Word-Level Translation Models
Philipp Köhn and Kevin Knight
11:10	11:35	Improving Lexical Mapping Model of English-Korean Bitext Using Structural Features
Seonho Kim, Juntae Yoon and Mansuk Song
11:35	11:45	Short break
11:45	12:45	Invited talk, Eric Brill "Paucity Shmaucity -- What Can We Do With A Trillion Words?"
12:45	2:00	LUNCH
	Session III: Text Categorization Session Chair: Wessel Kraaij
2:00	2:25	Stacking Classifiers for Anti-Spam Filtering of E-Mail
Georgios Sakkis, Ion Androutsopoulos, Georgios Paliouras, Vangelis Karkaletsis, Constantine D. Spyropoulos and Panagiotis Stamatopoulos
2:25	2:50	Feature Space Restructuring for SVMs with Application to Text Categorization
Hiroya Takamura and Yuji Matsumoto
2:50	3:15	Using Bins to Empirically Estimate Term Weights for Text Categorization
Carl Sable and Kenneth W. Church
3:15	3:25	Short break
	Session IV: Question Answering and Information Extraction Session Chair: Roman Yangarber
3:25	3:50	Question Answering Using a Large Text Database: A Machine Learning Approach
Hwee Tou Ng, Jennifer Lai Pheng Kwan and Yiyuan Xia
3:50	4:15	Information Extraction Using the Structured Language Model
Ciprian Chelba and Milind Mahajan
4:15	4:30	Short break
4:30	5:30	Panel: When does EM work? (includes an introduction to the Expectation-Maximization algorithm) Eugene Charniak, Kevin Knight, Ted Pedersen, Stefan Riezler

		MONDAY, JUNE 4
		Session V: Lexical Acquisition and Text Segmentation Session Chair: David Yarowsky
8:35	9:00	Classifying the Semantic Relations in Noun Compounds via a Domain-Specific Lexical Hierarchy
8:35	9:00	Barbara Rosario and Marti Hearst
9:00	9:25	The Unknown Word Problem: a Morphological Analysis of Japanese Using Maximum Entropy Aided by a Dictionary
9:00	9:25	Kiyotaka Uchimoto, Satoshi Sekine and Hitoshi Isahara
9:25	9:50	Is Knowledge-Free Induction of Multiword Unit Dictionary Headwords a Solved Problem?
9:25	9:50	Patrick Schone and Daniel Jurafsky
9:50	10:15	Latent Semantic Analysis for Text Segmentation
9:50	10:15	Freddy Y. Y. Choi, Peter Wiemer-Hastings and Johanna Moore
10:15	10:45	Break
		Session VI: Applications Session Chair: Marti Hearst
10:45	11:10	Detecting Short Passages of Similar Text in Large Document Collections
10:45	11:10	Caroline Lyon, James Malcolm and Bob Dickerson
11:10	11:35	Hybrid Text Mining for Finding Abbreviations and their Definitions
11:10	11:35	Youngja Park and Roy J. Byrd
11:35	11:45	Short break
11:45	12:45	Panel: What Works and What Doesn't? Industrial Perspectives Adam Berger, David Evans, Joshua Goodman, Lynette Hirschman
12:45	2:00	LUNCH
		Session VII: Spoken Language Output Session Chair: Roni Rosenfeld
2:00	2:25	Automatic Corpus-based Tone Prediction using K-ToBI Representation
2:00	2:25	Jin-Seok Lee, Byeongchang Kim and Gary Geunbae Lee
2:25	2:50	Probabilistic Context-Free Grammars for Syllabification and Grapheme-to-Phoneme Conversion
2:25	2:50	Karin Müller
2:50	3:00	Short break
		Session VIII: POS Tagging and Corpus Analysis Session Chair: Thorsten Brants
3:00	3:25	Comparing Data-Driven Learning Algorithms for PoS Tagging of Swedish
3:00	3:25	Beáta Megyesi
3:25	3:50	Impact of Quality and Quantity of Corpora on Stochastic Generation
3:25	3:50	Srinivas Bangalore, John Chen and Owen Rambow
3:50	4:15	Corpus Variation and Parser Performance
3:50	4:15	Daniel Gildea
4:15	4:30	Refreshments (close)

Note: the formatting of this page comes from the version generated for the NAACL CDROM.