mathoverflow-answers dataset
The mathoverflow-answers network is a hypergraph where hyperedges are sets of questions answered by users on Math Overflow. Nodes are labeled by the tags used in the questions, and nodes often have multiple labels. Some summary statistics of the dataset are:
  • number of nodes: 73,851
  • number of hyperedges: 5,446
  • mean / median hyperedge size: 24.2 / 5
  • rank of hypergraph (maximum hyperedge size): 1,784
  • number of node classes: 1,456
Data files: If you use this data, please cite the following paper:
  • Minimizing Localized Ratio Cut Objectives in Hypergraphs.
    Nate Veldt, Austin R. Benson, and Jon Kleinberg.
    Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), 2020. [bibtex]