
DuckDB in Action
Manning Publications (Publisher)
Published on 26. August 2024
Book
Paperback/Softback
312 pages
978-1-63343-725-8 (ISBN)
Description
Dive into DuckDB and start processing gigabytes of data with ease-all with no data warehouse.
DuckDB is a cutting-edge SQL database that makes it incredibly easy to analyze big data sets right from your laptop. In DuckDB in Action you'll learn everything you need to know to get the most out of this awesome tool, keep your data secure on prem, and save you hundreds on your cloud bill. From data ingestion to advanced data pipelines, you'll learn everything you need to get the most out of DuckDB-all through hands-on examples.
Open up DuckDB in Action and learn how to:
Read and process data from CSV, JSON and Parquet sources both locally and remote
Write analytical SQL queries, including aggregations, common table expressions, window functions, special types of joins, and pivot tables
Use DuckDB from Python, both with SQL and its "Relational"-API, interacting with databases but also data frames
Prepare, ingest and query large datasets
Build cloud data pipelines
Extend DuckDB with custom functionality
Pragmatic and comprehensive, DuckDB in Action introduces the DuckDB database and shows you how to use it to solve common data workflow problems. You won't need to read through pages of documentation-you'll learn as you work. Get to grips with DuckDB's unique SQL dialect, learning to seamlessly load, prepare, and analyze data using SQL queries. Extend DuckDB with both Python and built-in tools such as MotherDuck, and gain practical insights into building robust and automated data pipelines. About the technology DuckDB makes data analytics fast and fun! You don't need to set up a Spark or run a cloud data warehouse just to process a few hundred gigabytes of data. DuckDB is easily embeddable in any data analytics application, runs on a laptop, and processes data from almost any source, including JSON, CSV, Parquet, SQLite and Postgres.
DuckDB is a cutting-edge SQL database that makes it incredibly easy to analyze big data sets right from your laptop. In DuckDB in Action you'll learn everything you need to know to get the most out of this awesome tool, keep your data secure on prem, and save you hundreds on your cloud bill. From data ingestion to advanced data pipelines, you'll learn everything you need to get the most out of DuckDB-all through hands-on examples.
Open up DuckDB in Action and learn how to:
Read and process data from CSV, JSON and Parquet sources both locally and remote
Write analytical SQL queries, including aggregations, common table expressions, window functions, special types of joins, and pivot tables
Use DuckDB from Python, both with SQL and its "Relational"-API, interacting with databases but also data frames
Prepare, ingest and query large datasets
Build cloud data pipelines
Extend DuckDB with custom functionality
Pragmatic and comprehensive, DuckDB in Action introduces the DuckDB database and shows you how to use it to solve common data workflow problems. You won't need to read through pages of documentation-you'll learn as you work. Get to grips with DuckDB's unique SQL dialect, learning to seamlessly load, prepare, and analyze data using SQL queries. Extend DuckDB with both Python and built-in tools such as MotherDuck, and gain practical insights into building robust and automated data pipelines. About the technology DuckDB makes data analytics fast and fun! You don't need to set up a Spark or run a cloud data warehouse just to process a few hundred gigabytes of data. DuckDB is easily embeddable in any data analytics application, runs on a laptop, and processes data from almost any source, including JSON, CSV, Parquet, SQLite and Postgres.
More details
Language
English
Place of publication
New York
United States
Target group
Professional and scholarly
Product notice
Paperback (trade)
Unsewn / adhesive bound
Illustrations
Illustrations
Dimensions
Height: 235 mm
Width: 186 mm
Thickness: 16 mm
Weight
548 gr
ISBN-13
978-1-63343-725-8 (9781633437258)
Copyright in bibliographic data and cover images is held by Nielsen Book Services Limited or by the publishers or by their respective licensors: all rights reserved.
Schweitzer Classification
Other editions
Additional editions

Mark Needham | Michael Hunger | Michael Simons
DuckDB in Action
E-Book
09/2024
1st Edition
Simon + Schuster LLC
€49.44
Available for download
Persons
Mark Needham is a blogger and video creator at @LearnDataWithMark. Michael Hunger leads product innovation for the Neo4j graph database. Michael Simons is a Java Champion, author, and Engineer at Neo4j.