tags-stack-overflow
Description
Nodes represent tags. Hyperedges represent questions on stackoverflow.com and contain the set of tags applied to each question. The timestamps are recorded at millisecond resolution but are normalized to start at 0.
Basic statistics
- Nodes: 49998
- Hyperedges: 14458875
- Unique hyperedges: 5537637
- Max size hyperedge: 5
Hypergraph metadata
| Property | Description |
|---|---|
| name | (STRING) Dataset name (e.g., tags-stack-overflow). |
| type | (STRING) Hypergraph type (e.g., TemporalHypergraph). |
| version | (STRING) Dataset version (e.g., 1.0.0). |
| weighted | (BOOL) Whether the hypergraph is weighted (e.g., true). |
Node metadata
| Property | Description |
|---|---|
| tag | (STRING) Stack Overflow tag represented by the node (e.g., c#). |
Hyperedge metadata
| Property | Description |
|---|---|
| original_timestamp | (INT) Original question timestamp (e.g., 0). |
| time | (INT) Normalized question timestamp used by the temporal hypergraph (e.g., 0). |
| weight | (INT) Number of questions collapsed into this tag-set/time hyperedge (e.g., 1). |
Hyperedge size distribution
Hyperdegree distribution
Download
- Version 1.0.0 Binary (452.5 MB) JSON (210.3 MB)
Provenance
Source: https://www.cs.cornell.edu/~arb/data/tags-stack-overflow/
License: Not specified. Please refer to the original source for licensing terms.
Reproducibility: Instructions and scripts
Citation
When this data is used in published research or for visualization purposes, please cite the following:
Copied!
@article{benson2018simplicial,
author = {Benson, Austin R. and Abebe, Rediet and Schaub, Michael T. and Jadbabaie, Ali and Kleinberg, Jon},
title = {Simplicial closure and higher-order link prediction},
year = {2018},
doi = {10.1073/pnas.1800683115},
publisher = {National Academy of Sciences},
issn = {0027-8424},
journal = {Proceedings of the National Academy of Sciences}
}