Computer Science Colloquium Series, Spring 2005

Gerard Salton Lecture Series	Tuesday, March 1, 2005 4:15 pm B17 Upson Hall
Karen Spärck-Jones Cambridge University
Natural Language Information Processing: Words and Tasks, Statistics and Models
I will review the development of some key ideas in natural language information processing (NLIP), showing how statistical approaches were established for text retrieval and have spread to many other tasks like summarising. Some simple statistical techniques have proved very effective, and are now widely applied in operational systems as well as research. These techniques should be motivated by underlying models for the NLIP tasks. I will consider how different models apply to a single task, retrieval, and how a single model applies to very different tasks, illustrating the impact of the single-model approach on current research.

Karen Spärck-Jones Cambridge University