Menu:

The HIMALAYA Data Mining Project

The HIMALAYA Data Mining Project at Cornell researched innovative techniques for analyzing large datasets. Results from this project are available on this page.

Source Code

We made the source code of our algorithms available as part of the Himalaya data mining tools source code distribution on sourceforge. Code is available for the following algorithms: MAFIA (Mining Maximal Frequent Itemsets), SECRET (Scalable Linear Regression Trees), and (SPAM: Sequential Pattern Mining).

Past Research Topics

Publications

2004

2003

2002

2001

2000

1999

People

Researchers

Collaborators:

Alumni

Acknowledgements:

The HIMALAYA Data Mining Project is supported in part by NSF grants IIS-0121175 and IIS-0084762, the KD-D Initiative, by the Cornell Intelligent Information Systems Institute , the Cornell Information Assurance Institute , and by generous gifts from Microsoft and Intel. Any opinions, findings, conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the sponsors.