Dumpster Diving and Content Optimization
Search and content analytics are very closely tied. A term coined by Gartner is ‘dumpster diving.’ Sort of brings a smile to your face. That’s what a lot of folks are trying to do.
We are a great proponent of content optimization. Quite simply, it eliminates dumpster diving, and consists of cleaning up the redundant, obsolete, trivial (ROT) in your content before migration, to improve search, and to perform content analytics.
Most organizations don’t practice content optimization, or do it in a half-hearted way. No, I take that back, most organizations don’t clean up their content on a regular basis, let alone have processes on how to keep their content clear of garbage.
Why is this so? End users are hoarders. You don’t want to get rid of something because, well, you just may need a document that is five years old sometime. Some business units start to panic when they think something is to be deleted – gone forever.
According to the Compliance, Governance, and Oversight Council (CGOC), 69% of content can and should be deleted. Migration is known to be fraught with problems. The potential for exposure of sensitive information, migration of multiple copies of the same document, increased costs, inefficient use of resources, and unknown legal exposures, are just a few potential issues.
In most cases, documents all get migrated anyway. Why on earth? Messes up search even more than it’s already messed up, and increases the chance of a data breach.
We have a client who took content optimization seriously. And I mean seriously. Over 20 million documents were indexed using conceptClassifier for Office 365, allowing the company to decommission its 50 plus server on-premises search environment to just five. Now that’s content optimization.
Back to dumpster diving. I hope I have impressed upon you the importance of intelligent migration and content optimization. But how does it eliminate dumpster diving? Your data set becomes much smaller and relevant.
Why waste hours cleaning up information that is extraneous, duplicate, old, or redundant, when you can start with a clean slate and search for exactly and only what you’re are looking for. Eliminates a lot of time and frustration.