| SUNDAY, JUNE 3 | ||||
|---|---|---|---|---|
| 8:45 | 9:00 | Welcome | ||
| Session
I: Learning
 Session Chair: Paola Velardi  | 
||||
| 9:00 | 9:25 | Limitations of Co-Training for Natural Language Learning from Large Datasets | ||
| 9:25 | 9:50 | A Sequential Model for Multi-Class Classification | ||
| 9:50 | 10:15 | Learning Within-Sentence Semantic Coherence | ||
| 10:15 | 10:45 | Break | ||
| Session
II: Machine Translation
 Session Chair: Kenneth Church  | 
||||
| 10:45 | 11:10 | Knowledge Sources for Word-Level Translation Models | ||
| 11:10 | 11:35 | Improving Lexical Mapping Model of English-Korean Bitext Using Structural Features | ||
| 11:35 | 11:45 | Short break | ||
| 11:45 | 12:45 | Invited
talk, Eric Brill
 "Paucity Shmaucity -- What Can We Do With A Trillion Words?"  | 
||
| 12:45 | 2:00 | LUNCH | ||
| Session
III: Text Categorization
 Session Chair: Wessel Kraaij  | 
||||
| 2:00 | 2:25 | Stacking Classifiers for Anti-Spam Filtering of E-Mail | ||
| 2:25 | 2:50 | Feature Space Restructuring for SVMs with Application to Text Categorization | ||
| 2:50 | 3:15 | Using Bins to Empirically Estimate Term Weights for Text Categorization | ||
| 3:15 | 3:25 | Short break | ||
| Session
IV: Question Answering and Information Extraction
 Session Chair: Roman Yangarber  | 
||||
| 3:25 | 3:50 | Question Answering Using a Large Text Database: A Machine Learning Approach | ||
| 3:50 | 4:15 | Information Extraction Using the Structured Language Model | ||
| 4:15 | 4:30 | Short break | ||
| 4:30 | 5:30 | Panel:
When does EM work? (includes an introduction to the Expectation-Maximization
algorithm)
 Eugene Charniak, Kevin Knight, Ted Pedersen, Stefan Riezler  | 
||
| MONDAY, JUNE 4 | ||||
|---|---|---|---|---|
| Session
V: Lexical Acquisition and Text Segmentation
 Session Chair: David Yarowsky  | 
||||
| 8:35 | 9:00 | Classifying the Semantic Relations in Noun Compounds via a Domain-Specific Lexical Hierarchy | ||
| 9:00 | 9:25 | The Unknown Word Problem: a Morphological Analysis of Japanese Using Maximum Entropy Aided by a Dictionary | ||
| 9:25 | 9:50 | Is Knowledge-Free Induction of Multiword Unit Dictionary Headwords a Solved Problem? | ||
| 9:50 | 10:15 | Latent Semantic Analysis for Text Segmentation | ||
| 10:15 | 10:45 | Break | ||
| Session
VI: Applications
 Session Chair: Marti Hearst  | 
||||
| 10:45 | 11:10 | Detecting Short Passages of Similar Text in Large Document Collections | ||
| 11:10 | 11:35 | Hybrid Text Mining for Finding Abbreviations and their Definitions | ||
| 11:35 | 11:45 | Short break | ||
| 11:45 | 12:45 | Panel:
What Works and What Doesn't? Industrial Perspectives 
 Adam Berger, David Evans, Joshua Goodman, Lynette Hirschman  | 
||
| 12:45 | 2:00 | LUNCH | ||
| Session
VII: Spoken Language Output
 Session Chair: Roni Rosenfeld  | 
||||
| 2:00 | 2:25 | Automatic Corpus-based Tone Prediction using K-ToBI Representation | ||
| 2:25 | 2:50 | Probabilistic Context-Free Grammars for Syllabification and Grapheme-to-Phoneme Conversion | ||
| 2:50 | 3:00 | Short break | ||
| Session
VIII: POS Tagging and Corpus Analysis
 Session Chair: Thorsten Brants  | 
||||
| 3:00 | 3:25 | Comparing Data-Driven Learning Algorithms for PoS Tagging of Swedish | ||
| 3:25 | 3:50 | Impact of Quality and Quantity of Corpora on Stochastic Generation | ||
| 3:50 | 4:15 | Corpus Variation and Parser Performance | ||
| 4:15 | 4:30 | Refreshments (close) | ||
Note: the formatting of this page comes from the version generated for the NAACL CDROM.