coauth-cs-ICML

Description

This dataset is a subset of the Microsoft Academic Graph in which nodes represent authors and hyperedges correspond to their publications in ICML (a top-tier computer science conference). Papers with more than 25 authors were omitted. Note that this dataset is derived from cat-edge-MAG-10; consequently, some publications may be missing when the same set of authors published at multiple conferences in that source. In such cases, the most frequent venue of a given hyperedge was used as the interaction category, and ties were discarded.

Basic statistics

  • Nodes: 9981
  • Hyperedges: 4803
  • Unique hyperedges: 4803
  • Max size hyperedge: 23

Hyperedge size distribution

Hyperdegree distribution

Related datasets

Provenance

Source: https://www.cs.cornell.edu/~arb/data/cat-edge-MAG-10/

License: Not specified. Please refer to the original source for licensing terms.

Reproducibility: Instructions and scripts

Citation

When this data is used in published research or for visualization purposes, please cite the following:

                    
                    Copied!
                    @inproceedings{amburg2020clustering,
  title   = {Clustering in graphs and hypergraphs with categorical edge labels},
  author  = {Amburg, Ilya and Veldt, Nate and Benson, Austin R.},
  booktitle = {Proceedings of the Web Conference},
  year    = {2020}
}

@inproceedings{sinha2015mag,
  doi = {10.1145/2740908.2742839},
  url = {https://doi.org/10.1145/2740908.2742839},
  year  = {2015},
  publisher = {{ACM} Press},
  author = {Arnab Sinha and Zhihong Shen and Yang Song and Hao Ma and Darrin Eide and Bo-June (Paul) Hsu and Kuansan Wang},
  title = {An Overview of Microsoft Academic Service ({MAS}) and Applications},
  booktitle = {Proceedings of the 24th International Conference on World Wide Web}
}