Refereed conference and journal papers
|
| 2008
|
Daria Sorokina, Rich Caruana, Mirek Riedewald, Daniel Fink.
Detecting Statistical Interactions with Additive Groves of Trees. To appear in proceedings of the 25th International Conference on Machine Learning (ICML'08).(pdf).
|
| 2007
|
Daria Sorokina, Rich Caruana, Mirek Riedewald.
Additive Groves of Regression Trees. In proceedings of the 18th European Conference on Machine Learning (ECML'07). (Best Student Paper award.) (pdf).
|
| 2007
|
W. Hochachka, R. Caruana, A. Munson, M. Riedewald, D. Sorokina, D. Fink, S. Kelling.
Data-Mining Discovery of Pattern and Process in Ecological Systems. Journal of Wildlife Management: 71(7), pp. 2427-2437. (pdf).
|
| 2006
|
Daria Sorokina, Johannes Gehrke, Simeon Warner, Paul Ginsparg.
Plagiarism Detection in arXiv. In proceedings of the 6th IEEE International Conference on Data Mining (ICDM'06).
Short version: 6 pages (pdf) .
Full version: 13 pages (pdf).
|
| 2006
|
R. Caruana, M. Elhawary, A. Munson, M. Riedewald, D. Sorokina, D. Fink, W. Hochachka, S. Kelling.
Mining Citizen Science Data to Predict Prevalence of Wild Bird Species. In proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'06).
(pdf).
|
Tech reports
|
| 2006
|
Daria Sorokina, Johannes Gehrke, Simeon Warner, Paul Ginsparg.
Plagiarism Detection in arXiv. Technical Report TR2006-2046, Computing and Information Science, Cornell University, 2006. (full version of ICDM'06 paper pdf).
|
| 2003
|
Daria Sorokina, Mikhail Petrovskiy.
Adaptation of the Fuzzy Decision Tree Algorithm for Multidimensional Datacubes. Collected Articles on Software Systems and Tools, CMC MSU publishing, Moscow, Russia.
|
| 2003
|
Daria Erofeyeva.
Fuzzy Approach to Classification for Multidimensional Datacubes. Diplom thesis, Moscow State University.
|
|