Information Extraction using the Structured Language Model 
Ciprian Chelba and Milind Mahajan 
 
Knowledge Sources for Word-Level Translation Models 
Philipp Koehn and Kevin Knight 
 
Limitations of Co-training for Natural Language Learning from Large Datasets 
David Pierce and Claire Cardie 
 
Question Answering Using a Large Text Database: A Machine Learning Approach 
Hwee Tou Ng, Jennifer Lai Pheng Kwan, and Yiyuan Xia 
 
Stacking classifiers for anti-spam filtering of e-mail 
Georgios Sakkis, Ion Androutsopoulos, Georgios Paliouras, Vangelis Karkaletsis, Constantine D. Spyropoulos, and Panagiotis Stamatopoulos 
 
A Sequential Model for Multi-class Classification 
Yair Even-Zohar and  Dan Roth  
 
Feature Space Restructuring for SVMs with Application to Text Categorization 
Hiroya Takamura and Yuji Matsumoto 
 
Comparing Data-driven Learning Algorithms for PoS Tagging of Swedish 
Beata Megyesi 
 
Classifying Semantic Relations between Noun Compounds using a Domain-Specific Lexical Hierarchy 
Barbara Rosario and Marti Hearst 
 
Automatic Corpus-based Tone Prediction using K-ToBI Representation 
Jin-seok Lee, Byeongchang Kim and Gary Geunbae Lee 
 
Using Bins to Empirically Estimate Term Weights for Text Categorization 
Carl Sable and Ken Church 
 
Using Shallow NLP in Adaptive Information Extraction from Web-related Texts 
Fabio Ciravegna 
 
Detecting short passages of similar text in large document collections 
Caroline Lyon, Bob Dickerson and James Malcolm 
 
Impact of quality and quantity of corpora on stochastic generation 
Srinivas Bangalore, John Chen, and Owen Rambow 
 
Improving Lexical Mapping Model of English-Korean Bitext Using Structural Features 
Seonho Kim, Juntae Yoon and Mansuk Song 
 
Corpus Variation and Parser Performance 
Daniel Gildea 
 
The Unknown Word Problem: A Morphological Analysis of Japanese Using Maximum Entropy Aided by a Dictionary 
Kiyotaka Uchimoto, Satoshi Sekine, Hitoshi Isahara 
 
Latent Semantic Analysis for Text Segmentation 
Freddy Y. Y. Choi, Peter Wiemer-Hastings, and Johanna Moore 
 
Hybrid text mining for finding abbreviations and their definitions 
Youngja Park and Roy J. Byrd 
 
Is Knowledge-Free Induction of Multiword Unit Dictionary Headwords a Solved Problem? 
Patrick Schone and Daniel Jurafsky 
 
Learning Within-Sentence Semantic Coherence 
Elena Eneva, Rose Hoberman, and Lucian Lita 
 
Probabilistic Context-Free Grammars for Syllabification and Grapheme-to-Phoneme Conversion 
Karin Mueller