walmart-trips dataset
The walmart-trips network is a hypergraph where hyperedges are sets of co-purchased products at Walmart, as released as part of a Kaggle competition. We assigned products to one of ten broad departments in which the product appears on walmart.com (e.g., "Clothing, Shoes, and Accessories"), and these serve as node labels (there is also an additional "Other" class). Some summary statistics of the dataset are:
  • number of nodes: 88,860
  • number of hyperedges: 69,906
  • mean / median hyperedge size: 6.6 / 5
  • rank of hypergraph (maximum hyperedge size): 25
  • number of node classes: 11
Data files: If you use this data, please cite the following paper:
  • Clustering in graphs and hypergraphs with categorical edge labels.
    Ilya Amburg, Nate Veldt, and Austin R. Benson.
    Proceedings of the Web Conference (WWW), 2020. [bibtex]