congress-bills dataset
This is a temporal higher-order network dataset, which here means a
sequence of timestamped simplices where each simplex is a set of
nodes. In this dataset, nodes are US Congresspersons and simplices are
comprised of the sponsor and co-sponsors of legislative bills put
forth in both the House of Representatives and the Senate.
Timestamps are in days and represent when the bill was introduced.
The dataset was derived
from James Fowler's data.
The projected graph is a weighted undirected graph representing how
many times each pair of nodes co-appears in a simplex. We restricted
to simplices that consist of at most 25 nodes. Some basic statistics
of this dataset are:
- number of nodes: 1,718
- number of timestamped simplices: 260,851
- number of unique simplices: 85,082
- number of edges in projected graph: 424,932
- congress-bills.tar.gz (timestamped simplices and node labels)
- congress-bills-proj-graph.tar.gz (weighted projected graph)
- congress-bills-full.tar.gz (timestamped simplices and node labels)
- congress-bills-full-proj-graph.tar.gz (weighted projected graph)
- Simplicial closure and higher-order link prediction.
Austin R. Benson, Rediet Abebe, Michael T. Schaub, Ali Jadbabaie, and Jon Kleinberg.
Proceedings of the National Academy of Sciences (PNAS), 2018. [bibtex] -
Connecting the Congress: A Study of Cosponsorship Networks.
James H. Fowler.
Political Analysis, 2006. [bibtex] -
Legislative Cosponsorship Networks in the U.S. House and Senate.
James H. Fowler.
Social Networks, 2006. [bibtex]