amazon-reviews
Description
Hypergraph built from Amazon product reviews (specifically, the collection of 5-core datasets). Nodes represent products and each hyperedge represents the set of products reviewed by a user. Nodes are labeled by product category.
Basic statistics
- Nodes: 2268231
- Hyperedges: 4242421
- Unique hyperedges: 4242421
- Max size hyperedge: 9350
Hypergraph metadata
| Property | Description |
|---|---|
| type | (STRING) Hypergraph type (e.g., Hypergraph). |
| weighted | (BOOL) Whether the hypergraph is weighted (e.g., false). |
Node metadata
| Property | Description |
|---|---|
| label | (LIST[STRING]) Product category label (e.g., [Clothing_Shoes_and_Jewelry]). |
Hyperedge size distribution
Hyperdegree distribution
Download
- Version 1.0.0 Binary (518.0 MB) JSON (270.1 MB)
Provenance
Source: https://www.cs.cornell.edu/~arb/data/amazon-reviews/
License: Not specified. Please refer to the original source for licensing terms.
Reproducibility: Instructions and scripts
Citation
When this data is used in published research or for visualization purposes, please cite the following:
Copied!
@inproceedings{ni2019justifying,
title = {Justifying Recommendations using Distantly-Labeled Reviews and Fine-Grained Aspects},
author = {Ni, Jianmo and Li, Jiacheng and McAuley, Julian},
booktitle = {Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)},
pages = {188--197},
year = {2019},
doi = {10.18653/v1/D19-1018},
url = {https://doi.org/10.18653/v1/D19-1018}
}