
Natural Language Processing with Spark NLP
Learning to Understand Text at Scale
Alex Thomas(Author)
O'Reilly (Publisher)
Published on 10. July 2020
Book
Paperback/Softback
366 pages
978-1-4920-4776-6 (ISBN)
Description
If you want to build an enterprise-quality application that uses natural language text but aren't sure where to begin or what tools to use, this practical guide will help get you started. Alex Thomas, principal data scientist at Wisecube, shows software engineers and data scientists how to build scalable natural language processing (NLP) applications using deep learning and the Apache Spark NLP library.
Through concrete examples, practical and theoretical explanations, and hands-on exercises for using NLP on the Spark processing framework, this book teaches you everything from basic linguistics and writing systems to sentiment analysis and search engines. You'll also explore special concerns for developing text-based applications, such as performance.
In four sections, you'll learn NLP basics and building blocks before diving into application and system building:
Basics: Understand the fundamentals of natural language processing, NLP on Apache Stark, and deep learning
Building blocks: Learn techniques for building NLP applications-including tokenization, sentence segmentation, and named-entity recognition-and discover how and why they work
Applications: Explore the design, development, and experimentation process for building your own NLP applications
Building NLP systems: Consider options for productionizing and deploying NLP models, including which human languages to support
Through concrete examples, practical and theoretical explanations, and hands-on exercises for using NLP on the Spark processing framework, this book teaches you everything from basic linguistics and writing systems to sentiment analysis and search engines. You'll also explore special concerns for developing text-based applications, such as performance.
In four sections, you'll learn NLP basics and building blocks before diving into application and system building:
Basics: Understand the fundamentals of natural language processing, NLP on Apache Stark, and deep learning
Building blocks: Learn techniques for building NLP applications-including tokenization, sentence segmentation, and named-entity recognition-and discover how and why they work
Applications: Explore the design, development, and experimentation process for building your own NLP applications
Building NLP systems: Consider options for productionizing and deploying NLP models, including which human languages to support
More details
Language
English
Place of publication
Sebastopol
United States
Product notice
Paperback (trade)
Unsewn / adhesive bound
Dimensions
Height: 233 mm
Width: 178 mm
Thickness: 19 mm
Weight
581 gr
ISBN-13
978-1-4920-4776-6 (9781492047766)
Copyright in bibliographic data and cover images is held by Nielsen Book Services Limited or by the publishers or by their respective licensors: all rights reserved.
Schweitzer Classification
Other editions
Additional editions

E-Book
06/2020
O'Reilly
€50.49
Available for download

E-Book
06/2020
O'Reilly
€50.49
Available for download
Person
Alex Thomas is a data scientist at Indeed. He has used natural language processing (NLP) and machine learning with clinical data, identity data, and now employer and jobseeker data. He has worked with Apache Spark since version 0.9, and has worked with NLP libraries and frameworks including UIMA and OpenNLP.