email-Enron

Description

Nodes represent email addresses. Each hyperedge represents an email message, connecting the sender with all recipients (and may include a timestamp/weight when available). This dataset contains the emails from the Enron Email Dataset.

Basic statistics

  • Nodes: 84172
  • Hyperedges: 235395
  • Unique hyperedges: 111558
  • Max size hyperedge: 892

Hyperedge size distribution

Hyperdegree distribution

Download

Related datasets

Provenance

Source: https://www.cs.cmu.edu/~enron/

License: Not specified. Please refer to the original source for licensing terms.

Reproducibility: Instructions and scripts

Citation

When this data is used in published research or for visualization purposes, please cite the following:

                    
                    Copied!
                    @InProceedings{klimt2004enron,
 author="Klimt, Bryan and Yang, Yiming",
 title="The Enron Corpus: A New Dataset for Email Classification Research",
 booktitle="Machine Learning: ECML 2004",
 year="2004",
 publisher="Springer Berlin Heidelberg",
 address="Berlin, Heidelberg",
 pages="217--226",
 isbn="978-3-540-30115-8"
}