You are hereBlogs / Frank Diana's blog / Text Analytics and Enterprise Content Management

Text Analytics and Enterprise Content Management


Frank Diana's picture
By Frank Diana - Posted on 07 August 2008

As text analytics gets more attention, one area that I would expect to see discussed more is Enterprise Content Management (ECM).

 

There is a considerable amount of value associated with the automated classification of content on the way into the content repository. With regard to content access, search has been the common mechanism for finding relevant documents within the repository. The application of text analytics however takes these solutions to another level. Rather than finding content, text analytics enables the extraction of insight from content. I see the marriage of ECM and text analytics as a natural evolution of ECM. 

 
ECM is obviously a source of text data, along with the web, call center applications, email, and others. In some instances, content repositories house data from all over the enterprise. It seems like a natural place to focus strategies and architectures aimed at dealing with unstructured data. Over the last couple of months, I’ve had the benefit of looking at IBM’s strategy in this area. They have integrated their soon to be released Content Analyzer (formerly called IBM OmniFind Analytics Edition) with FileNet – a widely deployed Enterprise Content Management platform. Another vendor moving in this direction is IxReveal, as their uReka and uReveal products have been used in concert with SharePoint
 
These integration examples usher in the next phase of ECM, and have the potential to turn the platform into a strategic asset. I imagine we’ll see similar strategies from the other major ECM vendors – and as I said at the beginning, much more discussion about extending current ECM investments.

Post new comment

The content of this field is kept private and will not be shown publicly.
  • Web page addresses and e-mail addresses turn into links automatically.
  • Allowed HTML tags: <a> <em> <strong> <cite> <code> <ul> <ol> <li> <dl> <dt> <dd>
  • Lines and paragraphs break automatically.

More information about formatting options