vchoice-Yc-Items dataset
This is a variable subset choice dataset, so it consists of a collection of subset selections from varying slates of alternatives. In this dataset, the variable choice set is all items clicked on by a user in a given browsing session on an e-commerce web site. The subset selection is the set of items purchased in that session, which is a subset of those that were clicked on. This dataset was derived from data for the 2015 RecSys challenge (provided by YOOCHOOSE). Some basic statistics of this dataset are:
  • number of items: 2,975
  • number of subset selections: 156,039
Data files: If you use this data, please cite the following paper:
  • A Discrete Choice Model for Subset Selection.
    Austin R. Benson, Ravi Kumar, and Andrew Tomkins.
    In Proceedings of the 11th ACM International Conference on Web Search and Data Mining (WSDM), 2018. [bibtex]