Please enter keyword
Please enter keyword
Text Classification
The Enron Email Dataset is a large collection of email data from the Enron Corporation, made public by the Federal Energy Regulatory Commission during its investigation of the company. The dataset contains around 500,000 emails from about 150 users, primarily senior management. This dataset is widely used for research in natural language processing, machine learning, and social network analysis. It provides valuable insightscorporate communication patterns and is a key resource for developing and testing email classification, spam detection, and other email-related algorithms.

The Enron Email Dataset is a large collection of email data from the Enron Corporation, made public by the Federal Energy Regulatory Commission during its investigation of the company. The dataset contains around 500,000 emails from about 150 users, primarily senior management. This dataset is widely used for research in natural language processing, machine learning, and social network analysis. It provides valuable insightscorporate communication patterns and is a key resource for developing and testing email classification, spam detection, and other email-related algorithms.


The email dataset was later purchased by Leslie Kaelbling at MIT, and turned out to have a number of integrity problems. A number of folks at SRI, notably Melinda Gervasio, worked hard to correct these problems, and it is thanks to them (not me) that the dataset is available. The dataset here does not include attachments, and some messages have been deleted "as part of a redaction effort due to requests from affected employees". Invalid email addresses were converted to something of the form user@enron.com whenever possible (i.e., recipient is specified in some parse-able format like "Doe, John" or "Mary K. Smith") and to no_address@enron.com when no recipient was specified.


Prior versions of the dataset are no longer being distributed. If you are using the March 2, 2004 Version; the August 21, 2009 Version; or the April 2, 2011 Version of this dataset for your work, you are requested to replace it with the newer version of the dataset below, or make the the appropriate changes to your local copy. 


A Cognitive Assistant that Learns and Organizes

https://www.cs.cmu.edu/~./enron/





About GTI


GTI is an international cooperation platform initiated and established by China Mobile, Softbank, Vodafone and other operators in 2011. It currently has 146 operator members and 258 industry partners. In 2023, the new stage of GTI 3.0 was officially launched, and it is committed to continuously deepening the global cooperation of 5G-A+AI and achieving win-win commercial success. For more information, please visit http://gtigroup.org/


About the GTI 5G-A x AI Open Development Program


In Feb 2024 at MWC Barcelona, GTI launched the 5G-A×AI Development Program to promote the integrated innovation of 5G and AI in technology, business, ecology, and commerce, and two-way empowerment. Therefore, 5G is smarter and AI is more ubiquitous, which will support the goals of GTI 3.0. First, Build Open Labs to provide basic environment, equipment facilities, industry application scenarios and other resources for 5G-AxAI integration innovation, and carry out the R&D, testing and demonstration of new technologies and solutions. Second, Build an Open Collaborative Innovation Community, with an online platform for “Communication and Sharing” and “Supply and Demand Matching”, and jointly carry out cutting-edge exploration, technical research, testing and iterative optimization. Third, Explore Innovative 5G-AxAI Integration Use Cases, and condense themreplicable business model templates, so as to provide references for value creation and monetization.


About the China Mobile Research Institution Service Division

……

Back top