
Spark in Action, Second Edition
Jean-Georges Perrin(Author)
Manning Publications (Publisher)
2nd Edition
Published on 22. June 2020
Book
Paperback/Softback
576 pages
978-1-61729-552-2 (ISBN)
Shipment within 10-20 days
Description
The Spark distributed data processing platform provides an easy-to-implement tool for ingesting, streaming, and processing data from any source. In Spark in Action, Second Edition, you'll learn to take advantage of Spark's core features and incredible processing speed, with applications including real-time computation, delayed evaluation, and machine learning.
Unlike many Spark books written for data scientists, Spark in Action, Second Edition is designed for data engineers and software engineers who want to master data processing using Spark without having to learn a complex new ecosystem of languages and tools. You'll instead learn to apply your existing Java and SQL skills to take on practical, real-world challenges.
Key Features
? Lots of examples based in the Spark Java APIs using real-life dataset and scenarios
? Examples based on Spark v2.3 Ingestion through files, databases, and streaming
? Building custom ingestion process
? Querying distributed datasets with Spark SQL
For beginning to intermediate developers and data engineers comfortable programming in Java. No experience with functional programming, Scala, Spark, Hadoop, or big data is required.
About the technology
Spark is a powerful general-purpose analytics engine that can handle massive amounts of data distributed across clusters with thousands of servers. Optimized to run in memory, this impressive framework can process data up to 100x faster than most Hadoop-based systems.
Author Bio
An experienced consultant and entrepreneur passionate about all things data, Jean-Georges Perrin was the first IBM Champion in France, an honor he's now held for ten consecutive years. Jean-Georges has managed many teams of software and data engineers.
Unlike many Spark books written for data scientists, Spark in Action, Second Edition is designed for data engineers and software engineers who want to master data processing using Spark without having to learn a complex new ecosystem of languages and tools. You'll instead learn to apply your existing Java and SQL skills to take on practical, real-world challenges.
Key Features
? Lots of examples based in the Spark Java APIs using real-life dataset and scenarios
? Examples based on Spark v2.3 Ingestion through files, databases, and streaming
? Building custom ingestion process
? Querying distributed datasets with Spark SQL
For beginning to intermediate developers and data engineers comfortable programming in Java. No experience with functional programming, Scala, Spark, Hadoop, or big data is required.
About the technology
Spark is a powerful general-purpose analytics engine that can handle massive amounts of data distributed across clusters with thousands of servers. Optimized to run in memory, this impressive framework can process data up to 100x faster than most Hadoop-based systems.
Author Bio
An experienced consultant and entrepreneur passionate about all things data, Jean-Georges Perrin was the first IBM Champion in France, an honor he's now held for ten consecutive years. Jean-Georges has managed many teams of software and data engineers.
More details
Edition
2nd edition
Language
English
Place of publication
New York
United States
Target group
Professional and scholarly
Dimensions
Height: 233 mm
Width: 187 mm
Thickness: 31 mm
Weight
1066 gr
ISBN-13
978-1-61729-552-2 (9781617295522)
Copyright in bibliographic data and cover images is held by Nielsen Book Services Limited or by the publishers or by their respective licensors: all rights reserved.
Schweitzer Classification
Other editions
New editions

Vladimir Khorikov
Unit Testing:Principles, Practices and Patterns
Effective Testing Styles, Patterns, and Reliable Automation for Unit Testing, Mocking, and Integration Testing with Examples in C
Book
03/2020
Manning Publications
€47.00
Shipment within 10-20 days
Additional editions

E-Book
05/2020
1st Edition
Manning
€49.44
Available for download
Person
An experienced consultant and entrepreneur passionate about all things data, Jean-Georges Perrin was the first IBM Champion in France, an honor he's now held for ten consecutive years. Jean-Georges has managed many teams of software and data engineers.