The corpus was generated from Enron email servers by the Federal Energy Regulatory Commission (FERC) during its subsequent investigation.
In the legal investigation into Enron's collapse, the discovery process required collecting and preserving vast amounts of data, for which the FERC hired Aspen Systems (now part of Lockheed Martin).
The emails were collected at Enron Corporation headquarters in Houston during two weeks in May 2002 by Joe Bartling,[4] a litigation support and data analysis contractor for Aspen.
Once collected, the Enron emails were processed and hosted in proprietary electronic discovery platforms (first Concordance, then iCONECT) for review by investigators from the FERC, Commodity Futures Trading Commission, and Department of Justice.
At the conclusion of the investigation, and upon the issuance of the FERC staff report,[6] the emails and information collected were deemed to be in the public domain, to be used for historical research and academic purposes.