coauth-DBLP

Description

Nodes represent authors. Each hyperedge represents a publication and contains the set of co-authors of that publication. This is a temporal higher-order network dataset, where each hyperedge is a publication recorded on DBLP. Timestamps are the year of publication.

Basic statistics

  • Nodes: 1930378
  • Hyperedges: 3700681
  • Unique hyperedges: 2467396
  • Max size hyperedge: 280

Hyperedge size distribution

Hyperdegree distribution

Related datasets

Provenance

Source: https://www.cs.cornell.edu/~arb/data/coauth-DBLP/

License: Not specified. Please refer to the original source for licensing terms.

Reproducibility: Instructions and scripts

Citation

When this data is used in published research or for visualization purposes, please cite the following:

                    
                    Copied!
                    @article{benson2018simplicial,
 author = {Benson, Austin R. and Abebe, Rediet and Schaub, Michael T. and Jadbabaie, Ali and Kleinberg, Jon},
 title = {Simplicial closure and higher-order link prediction},
 year = {2018},
 doi = {10.1073/pnas.1800683115},
 publisher = {National Academy of Sciences},
 issn = {0027-8424},
 journal = {Proceedings of the National Academy of Sciences}
}