Taming Text

- How to Find, Organize, and Manipulate It

  • Format
  • Bog, paperback
  • Engelsk

Beskrivelse

Summary"Taming Text," winner of the 2013 Jolt Awards for Productivity, is a hands-on, example-driven guide to working with unstructured text in the context of real-world applications. This book explores how to automatically organize text using approaches such as full-text search, proper name recognition, clustering, tagging, information extraction, and summarization. The book guides you through examples illustrating each of these topics, as well as the foundations upon which they are built.About this BookThere is so much text in our lives, we are practically drowningin it. Fortunately, there are innovative tools and techniquesfor managing unstructured information that can throw thesmart developer a much-needed lifeline. You'll find them in thisbook."Taming Text" is a practical, example-driven guide to working withtext in real applications. This book introduces you to useful techniques like full-text search, proper name recognition, clustering, tagging, information extraction, and summarization.You'll explore real use cases as you systematically absorb thefoundations upon which they are built.Written in a clear and concise style, this book avoids jargon, explainingthe subject in terms you can understand without a backgroundin statistics or natural language processing. Examples arein Java, but the concepts can be applied in any language.Written for Java developers, the book requires no prior knowledge of GWT. Purchase of the print book comes with an offer of a free PDF, ePub, and Kindle eBook from Manning. Also available is all code from the book. Winner of 2013 Jolt Awards: The Best Books one of five notable books every serious programmer should read.What's InsideWhen to use text-taming techniquesImportant open-source libraries like Solr and MahoutHow to build text-processing applicationsAbout the AuthorsGrant Ingersoll is an engineer, speaker, and trainer, a Lucenecommitter, and a cofounder of the Mahout machine-learning project. Thomas Morton is the primary developer of OpenNLP and Maximum Entropy. Drew Farris is a technology consultant, software developer, and contributor to Mahout, Lucene, and Solr.""Takes the mystery out of verycomplex processes."" From the Foreword by Liz Liddy, Dean, iSchool, Syracuse UniversityTable of ContentsGetting started taming textFoundations of taming textSearchingFuzzy string matchingIdentifying people, places, and thingsClustering textClassification, categorization, and taggingBuilding an example question answering systemUntamed text: exploring the next frontier "

Læs hele beskrivelsen
Detaljer
  • SprogEngelsk
  • Sidetal298
  • Udgivelsesdato24-01-2013
  • ISBN139781933988382
  • Forlag Manning Publications
  • FormatPaperback
Størrelse og vægt
  • Vægt535 g
  • Dybde1,7 cm
  • coffee cup img
    10 cm
    book img
    18,7 cm
    23,4 cm

    Findes i disse kategorier...

    Se andre, der handler om...

    Velkommen til Saxo – din danske boghandel

    Hos os kan du handle som gæst, Saxo-bruger eller Saxo-medlem – du bestemmer selv. Skulle du få brug for hjælp, sidder vores kundeservice-team klar ved både telefonerne og tasterne.

    Om medlemspriser hos Saxo

    For at købe bøger til medlemspris skal du være medlem af Saxo Premium, Saxo Shopping eller Saxo Ung. De første 7 dage er gratis for nye medlemmer. Medlemskabet fornyes automatisk og kan altid opsiges. Læs mere om fordelene ved vores forskellige medlemskaber her.

    Machine Name: SAXO080