The AIIM Blog - Overcoming Information Chaos

Content Analytics: How Do These New Tools Stack Up Against Search Engines?

Written by John Mancini | May 17, 2010 4:02:02 PM

The term "Content Analytics" has been coined to cover a range of advanced search and content reporting technologies. In our new Industry Watch survey, we found that organizations could derive much higher business value from content analytics tools than from simple search-engines.

Sophisticated content reporting across text documents and rich media file-types have created the opportunity to report and research across unstructured content, bringing the same capabilities of strategic insight and improved decision-making as Business Intelligence (BI) reporting brings to structured content.

Relatively new as a recognized toolset, content analytics tools provide trend analysis, content assessment, pattern recognition, and exception detection. In this report, we have explored the user-perceived limitations of conventional search and the potential savings that could be achieved by the application of content analytics to a number of business scenarios such as fraud detection, asset protection, healthcare research, and market monitoring.

Here are some of our key findings:

  • For 72% of respondents, it is harder to find information owned by their organization than information not owned by them – i.e., on the Web.

  • Of the 47% who find they frequently need to use Advanced Search options, more than half would like something more effective.

  • 70% would find advanced analytic functions “extremely useful” or “very useful.”

  • For most content types, our respondent’s ability to “research” is 3-6 times less than their ability to “search,” particularly for rich media files, but also office documents and emails.

  • E-discovery, Digital Asset Management (DAM), Web Analytics and De-duplication are the better-known technologies compared to Sentiment Analysis, Copyright Detection, and Digital Forensics.

  • There are strong plans to adopt DAM, Faceted Search, E-discovery, and Content Assessment in the next 18 months.

  • The biggest obstacle faced regarding content decommissioning is “not clear which content is valuable and which is not.” There is also considerable “Fear of the compliance and regulatory impact of deleting information.”

  • Only 15% have an automatic way of finding and deleting duplicates in their content stores, with just 8% able to analyze them automatically for relevancy and to delete irrelevant content.

  • 50% would find it of “high” or “very high” commercial value to be able to link a customer/citizen/ staff-member search across structured (database) data & unstructured documents & case notes. 44% would find it of “high” or “very high” commercial value to be able to automatically redact (blank out) sensitive information across forms, etc.

  • 81% of those who have digital assets to manage are not using a dedicated Digital Asset Management system, but 14% of our respondents are planning to implement one in the next 18 months. 48% store digital assets and rich media on ad hoc file shares.

  • 59% would find it of “high” or “very high” commercial value if they could use a faceted search across multiple metadata tags to cross-reference categories of rich media. 50% would find it of “high” or “very high” commercial value if they could detect unauthorized use of their assets across the web.

  • Net spending on Enterprise Search, Digital Asset Management and Content Analytics is set for a considerable increase in the next 12 months.