Smart Text: How to Turn Big Text into Big Data
Share this Session:
  Tom Reamy   Tom Reamy
Chief Knowledge Architect
KAPS Group
 


 

Thursday, August 21, 2014
11:15 AM - 11:45 AM

Level:  All - General Audience


Big Data continues to dominate the press and continues to deliver great new applications, but there is something much bigger and potentially much more valuable: Smart Text. Unstructured text is now estimated to constitute 90% of business information with the rise of social media, yet companies continue to struggle to get real business value from all that information. Text Analytics has the potential to realize that value through information and fact extraction, but needs to go beyond simple extraction and add more intelligence.

This talk, based on two case studies in industries as diverse as publishing and biopharma, looks at the techniques to add intelligence to extraction through disambiguation, distinguishing major from minor mentions (it’s often just as important to know when not to extract), and applying auto-categorization to provide more structured context. The ability to provide context not only improves the accuracy of extraction in cases that, for example, need to distinguish dates that are significant from random dates sprinkled through a document, or mentions of a company or organization where the company is the focus of the sentence or paragraph from mentions that are incidental. The result is the ability to intelligently extract from all that rich text and convert it into data to which all the latest big data techniques can be applied.


Tom Reamy is currently the Chief Knowledge Architect and founder of KAPS Group [http://www.kapsgroup.com], a group of knowledge architecture, text analytics, and taxonomy consultants. He has 20 years of experience in information projects of various kinds. He has published a number of articles in a variety of journals and is a frequent speaker at knowledge management, taxonomy, and text analytics conferences.

For the last 10 years, his primary focus has been on text analytics, helping clients select the best text analytics software, as well as doing text analytics development projects that include applications such as call support, voice of the customer, enterprise search, and content management.

When not writing or developing text analytics projects, he can usually be found at the bottom of the ocean in Carmel, photographing strange critters.


   
Close Window