Distributional similarity models: Clustering vs. nearest neighbors.
Lillian Lee and Fernando Pereira.
37th Annual Meeting of the ACL, pp. 33--40, 1999

Abstract: Distributional similarity is a useful notion in estimating the probabilities of rare joint events. It has been employed both to cluster events according to their distributions, and to directly compute averages of estimates for distributional neighbors of a target event. Here, we examine the tradeoffs between model size and prediction accuracy for cluster-based and nearest neighbors distributional models of unseen events.

Paper formats: ps, pdf

BibTeX entry:

@InProceedings{Lee+Pereira:99a,
  author = 	 {Lillian Lee and Fernando Pereira},
  title = 	 {Distributional similarity models: Clustering vs. nearest neighbors},
  booktitle = 	 "37th Annual Meeting of the Association for Computational Linguistics",
  pages =	 {33--40},
  year =	 1999
}


Back links: Lillian Lee's home page or papers page; Cornell NLP page.