The Enron v1 data set previously hosted by EDRM (www.edrm.net) has served for many years as an industry-standard collection of email data for electronic discovery training and testing. Since this data set was originally made available by FERC, it has been an open secret that it contained many instances of private, health and financial data about the company’s former employees.
Nuix specialists cleansed the EDRM Enron data set of private information. We identified and removed more than 10,000 items of information including:
Many items contained multiple instances and types of information. This included departmental contact list spreadsheets with dates of birth, credit card numbers, Social Security numbers, home addresses and other private details of dozens of staff members.
In removing these items and making the cleansed data set available to the community, we hope to protect the privacy of hundreds of individuals.
Nuix is also pleased to offer the legal and investigator community the methodology we used for identifying personal and financial data in corporate data sets.
Although the EDRM Enron data set is more than 10 years old, most organizations still face significant risks relating to private information stored in their systems.
These files may contain personally identifiable information, in spite of efforts to remove that information. If you find PII that you think should be removed, please notify us at firstname.lastname@example.org.