
Natural Language Processing with Java -
Bostjan Kaluza(Author)
Packt Publishing
2nd Edition
Published on 30. March 2018
Book
Paperback/Softback
407 pages
978-1-78728-807-2 (ISBN)
Description
Explore various approaches to organize and extract useful text from unstructured data using Java
About This Book
* Understand data hidden in text using the power of Java and natural language processing
* Find data, patterns, and gain interesting insights from language using this easy-to-follow book
* Get all the information to get up and running with natural language processing using this example-rich guide
Who This Book Is For
This book appeals to data analysts, or data science professionals who are looking to extract information from language using Java. Previous experience with Java and statistics is expected.
What You Will Learn
* Develop a deep understanding of the basic NLP tasks and how they relate to each other
* Discover and use the available tokenization engines
* Implement techniques for end of sentence detection
* Apply search techniques to find people and things within a document
* Construct solutions to identify parts of speech within sentences
* Use parsers to extract relationships between elements of a document
* Identify topics in a set of documents
* Integrate basic tasks to tackle more complex NLP problems
In Detail
Natural language processing allows taking any sentence and identify patterns, soecial names, company names, and such parts. This book will teach how to perform language analysis with the help of amazing libraries in Java and get insights from this analysis.You will start off with understanding how natural language processing works and various concepts in it. Followed by this, you will learn about important tools and libraries in Java for NLP. After this, you will directly dive into performing natural language processing on different inputs. You will learn things such as tokenization, finind entities, model training, parts of speech, parsing trees,a nd more. You will also learn about machine learning and corpus based methods and algorithms. Also, you will learn about statistical machine translation, summarization, dialog systems, complex searches, supervised and unsupervised NLP, and more.
About This Book
* Understand data hidden in text using the power of Java and natural language processing
* Find data, patterns, and gain interesting insights from language using this easy-to-follow book
* Get all the information to get up and running with natural language processing using this example-rich guide
Who This Book Is For
This book appeals to data analysts, or data science professionals who are looking to extract information from language using Java. Previous experience with Java and statistics is expected.
What You Will Learn
* Develop a deep understanding of the basic NLP tasks and how they relate to each other
* Discover and use the available tokenization engines
* Implement techniques for end of sentence detection
* Apply search techniques to find people and things within a document
* Construct solutions to identify parts of speech within sentences
* Use parsers to extract relationships between elements of a document
* Identify topics in a set of documents
* Integrate basic tasks to tackle more complex NLP problems
In Detail
Natural language processing allows taking any sentence and identify patterns, soecial names, company names, and such parts. This book will teach how to perform language analysis with the help of amazing libraries in Java and get insights from this analysis.You will start off with understanding how natural language processing works and various concepts in it. Followed by this, you will learn about important tools and libraries in Java for NLP. After this, you will directly dive into performing natural language processing on different inputs. You will learn things such as tokenization, finind entities, model training, parts of speech, parsing trees,a nd more. You will also learn about machine learning and corpus based methods and algorithms. Also, you will learn about statistical machine translation, summarization, dialog systems, complex searches, supervised and unsupervised NLP, and more.
More details
Edition
2nd Revised edition
Language
English
Place of publication
Birmingham
United Kingdom
Target group
Professional and scholarly
Edition type
Revised edition
Dimensions
Height: 235 mm
Width: 191 mm
ISBN-13
978-1-78728-807-2 (9781787288072)
Copyright in bibliographic data and cover images is held by Nielsen Book Services Limited or by the publishers or by their respective licensors: all rights reserved.
Schweitzer Classification
Person
Bostjan Kaluza, PhD, is a researcher in artificial intelligence and machine learning. Bostjan is the chief data scientist at Evolven, a leading IT operations analytics company, focusing on configuration and change management. He works with machine learning, predictive analytics, pattern mining, and anomaly detection to turn data into understandable relevant information and actionable insight. Prior to Evolven, Bostjan served as a senior researcher in the department of intelligent systems at the Jozef Stefan Institute, a leading Slovenian scientific research institution, and led research projects involving pattern and anomaly detection, ubiquitous computing, and multi-agent systems. Bostjan was also a visiting researcher at the University of Southern California, where he studied suspicious and anomalous agent behavior in the context of security applications. Bostjan has extensive experience in Java and Python, and he also lectures on Weka in the classroom. Focusing on machine learning and data science, Bostjan has published numerous articles in professional journals, delivered conference papers, and authored or contributed to a number of patents. In 2013, Bostjan published his first book on data science, Instant Weka How-to, Packt Publishing, exploring how to leverage machine learning using Weka.