Three files documenting the EDRM Enron Email Data Set v2 are now available. They are:
- edrm-enron-v2_pst-md5.txt: This file includes a listing of the PST ZIP files and the PST files themselves along with MD5 hashes
- edrm-enron-v2_xml-md5.txt: This file includes a listing of the XML ZIP files along with MD5 hashes
- edrm-enron-v2_dataset-info-md5.json: This JSON file includes information on the dataset including file counts, document counts, files by custodian, and MD5 hashes for PST ZIP, PST, and XML ZIP files
A complete list of EDRM Data Set downloads is here.












This set consists of Enron e-mail messages and attachments in zipped files. The files are organized by custodian. Each zipped file should contain some combination..
[...] This post was mentioned on Twitter by Peg Duncan and InfoGovernance, Orange LT. Orange LT said: E#DRM Enron Email Data Set v2 Information Files Available – http://tinyurl.com/25a9h5h [...]