sos-coauth-Business dataset
This dataset is a collection of sequences of sets, where each sequence is the time-ordered sets of coauthors of a researcher's publications. Publication data comes from the Microsoft Academic Graph, where the paper is labeled with the "Business" subject. All sequences contain at least 10 sets, and only sets of size at most 5 are considered. Some basic statistics of this dataset are:
  • number of sequences: 24,019
  • number of unique elements appearing in sets: 236,226
  • number of sets: 463,070
  • number of unique sets: 271,294
Data: If you use this data, please cite the following papers:
  • Sequences of sets.
    Austin R. Benson, Ravi Kumar, and Andrew Tomkins.
    Proceedings of KDD, 2018. [bibtex]
  • Simplicial closure and higher-order link prediction.
    Austin R. Benson, Rediet Abebe, Michael T. Schaub, Ali Jadbabaie, and Jon Kleinberg.
    Proceedings of the National Academy of Sciences (PNAS), 2018. [bibtex]
  • An overview of Microsoft Academic Service (MAS) and applications.
    Arnab Sinha, Zhihong Shen, Yang Song, Hao Ma, Darrin Eide, Bo-June Hsu, and Kuansan Wang.
    Proceedings of WWW, 2015. [bibtex]