amazon-reviews

Description

Hypergraph built from Amazon product reviews (specifically, the collection of 5-core datasets). Nodes represent products and each hyperedge represents the set of products reviewed by a user. Nodes are labeled by product category.

Basic statistics

  • Nodes: 2268231
  • Hyperedges: 4242421
  • Unique hyperedges: 4242421
  • Max size hyperedge: 9350

Hyperedge size distribution

Hyperdegree distribution

Related datasets

Provenance

Source: https://www.cs.cornell.edu/~arb/data/amazon-reviews/

License: Not specified. Please refer to the original source for licensing terms.

Reproducibility: Instructions and scripts

Citation

When this data is used in published research or for visualization purposes, please cite the following:

                    
                    Copied!
                    @inproceedings{ni2019justifying,
  title = {Justifying Recommendations using Distantly-Labeled Reviews and Fine-Grained Aspects},
  author = {Ni, Jianmo and Li, Jiacheng and McAuley, Julian},
  booktitle = {Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)},
  pages = {188--197},
  year = {2019},
  doi = {10.18653/v1/D19-1018},
  url = {https://doi.org/10.18653/v1/D19-1018}
}