Taming Text

下载地址
Taming Text
It is no secret that the world is drowning in text and data. This causes real problems for everyday users who need to make sense of all the information available, and software engineers who want to make their text-based applications more useful and user-friendly. Whether you’re building a search engine for a corporate website, automatically organizing email, or extracting important nuggets of information from the news, dealing with unstructured text can be a daunting task.

Taming Text is a hands-on, example-driven guide to working with unstructured text in the context of real-world applications. This book explores how to automatically organize text using approaches such as full-text search, proper name recognition, clustering, tagging, information extraction, and summarization. The book guides you through examples illustrating each of these topics, as well as the foundations upon which they are bulit.

世界淹没在文字和数据中已不是秘密。这给需要理解所有可用信息的日常用户和希望使其基于文本的应用程序更加有用和用户友好的软件工程师带来了真正的问题。无论你是为公司网站建立搜索引擎,自动组织电子邮件,还是从新闻中提取重要信息,处理非结构化文本都是一项艰巨的任务。

Taming Text是一个实际操作的示例驱动的指南,用于在实际应用程序的上下文中处理非结构化文本。这本书探索如何使用诸如全文搜索、专有名称识别、聚类、标记、信息提取和摘要等方法自动组织文本。这本书指导你举例说明每一个主题,以及它们的基础。