Welcome to DU! The truly grassroots left-of-center political community where regular people, not algorithms, drive the discussions and set the standards. Join the community: Create a free account Support DU (and get rid of ads!): Become a Star Member Latest Breaking News General Discussion The DU Lounge All Forums Issue Forums Culture Forums Alliance Forums Region Forums Support Forums Help & Search

Yavin4

Yavin4's Journal
Yavin4's Journal
November 7, 2016

What you need to know about email de-duplication

I work in a field called Electronic Discovery, and we deal with processing, reviewing, and producing emails all of the time. De-duplication of emails has been around for years. There are several existing applications that can quickly de-duplicate an email set. The process is described below:

Yes. De-duplication is usually performed by comparing cryptographic hashes (e.g. MD5, SHA1 etc.) of documents to each other. The calculated hash values are based on the binary contents of documents and do not take into account external metadata that is stored in the file system. Therefore, two files with the same contents but different file names would produce the same hash value.

Most e-Discovery service providers would allow you to use a custom hash that includes your choice of metadata fields in addition to document contents for de-duplication. For example, you could choose to include the file name field in your custom hash if you would like to make sure that documents can be considered duplicates only when their file names are also identical.


You can read more about it here:

http://www.meridiandiscovery.com/articles/frequently-asked-questions-about-de-duplication/

--OnEdit--

I post this so that you will have reference materials to combat your right wing relatives at Thanksgiving when they go off about "rigged" system and you cannot review 650,000 emails.
November 2, 2016

Anyone remember the site, Hillaryis44.com from 2008?

Well, their true colors have been revealed. They endorsed Trump. I shit you not.

November 2nd, 2016
HillaryIs44 Officially Endorses @RealDonaldTrump For President
Since 2007 we have been the preeminent Hillary Clinton support site. For years we called ourselves a “pink aircraft carrier” willing, able and ready to defend and support Hillary Clinton. After 2008 we compared ourselves to soldiers in “winter quarters” willing, able, and ready-for-Hillary the moment she declared herself a candidate. Today we endorse Donald J. Trump for president.

Our readers deserve a fuller explanation as to why we, who have supported Hillary Clinton for so long, now endorse fully and urge every American to vote for Donald J. Trump. We will begin with a brief revisit of our history in order to make clear our support for Hillary Clinton has been unquestioned. We will then make the case for Donald J. Trump.


http://hillaryis44.com/

November 2, 2016

Looks like it Game 7

Only appropriate.

October 28, 2016

Is there a NFL game on tonight?

Who's playing?

October 28, 2016

"Hillary just tried to kill Mike Pence in order to get a leg up in 2020!"

How long before we see that theory floating around?

October 27, 2016

Milo the dog steals a sausage. A good attorney would get him off.

I love how he checks the plate afterwards just in case that it was some kind of magic plate where sausages just appear:

October 25, 2016

Browns' Cody Kessler, Josh McCown and Kevin Hogan are all possibilities to start against the Jets

The question is: Does it really matter? The answer is: No.

October 25, 2016

The only way to control soaring healthcare costs is to have single payer

However, Republicans, some Dems, and a lot of people who get employer covered insurance won't support it.

Profile Information

Member since: 2001
Number of posts: 35,438
Latest Discussions»Yavin4's Journal