sos-contact-high-school dataset
This dataset is a collection of sequences of sets. The sets are constructed from interactions recorded by wearable sensors in a high school. The sensors record proximity-based contacts every 20 seconds. There is one sequence of sets per person, and we consider the set of individuals that a person comes into contact within each 20 second interval to be a set (only nonempty sets are considered; some intervals contain no interactions). All sequences contain at least 10 sets, and only sets of size at most 5 are considered. Some basic statistics of this dataset are:
  • number of sequences: 325
  • number of unique elements appearing in sets: 327
  • number of sets: 308,990
  • number of unique sets: 9,785
Data: If you use this data, please cite the following papers:
  • Sequences of sets.
    Austin R. Benson, Ravi Kumar, and Andrew Tomkins.
    Proceedings of KDD, 2018. [bibtex]
  • Contact Patterns in a High School: A Comparison between Data Collected Using Wearable Sensors, Contact Diaries and Friendship Surveys.
    Rossana Mastrandrea, Julie Fournet, and Alain Barrat.
    PLOS ONE, 2015. [bibtex]