
Architecting an Apache Iceberg Lakehouse
Alex Merced(Author)
Manning Publications (Publisher)
Published on 9. June 2026
Book
Paperback/Softback
408 pages
978-1-63343-510-0 (ISBN)
Description
Building a data platform can be difficult, and many systems are expensive to run. When you rely on one vendor, it can be hard to change or improve your setup later. A modern data platform should be reliable, able to grow with your needs, and not tied to costly licences. Open-source tools give you more control over how your platform works and how much you spend. This book shows a clear way to build a complete data platform using Apache Iceberg.
Learn how to create a modular and scalable Iceberg Lakehouse architecture.
Understand where Spark, Flink, Dremio, and Polaris fit into your design.
Build reliable batch and streaming ingestion pipelines for your data.
Apply strategies for governance, security, and performance at scale.
Connect architectural theory with practical implementation through hands-on examples.
Architecting an Apache Iceberg Lakehouse is a practical guide to designing a complete data platform. The book carefully guides you through each layer of the architecture, from storage and ingestion to governance and security. It uses hands-on examples, such as ingesting data with Apache Spark and building dashboards in Apache Superset, to illustrate key concepts.
After reading this book, you will understand how to design and build a data platform that can grow with your needs. It explains the key choices you need to make when working with large amounts of data in real projects.
This book is written for data architects who already know the basics of data Lakehouses and are planning a new system or moving from an existing one.
Learn how to create a modular and scalable Iceberg Lakehouse architecture.
Understand where Spark, Flink, Dremio, and Polaris fit into your design.
Build reliable batch and streaming ingestion pipelines for your data.
Apply strategies for governance, security, and performance at scale.
Connect architectural theory with practical implementation through hands-on examples.
Architecting an Apache Iceberg Lakehouse is a practical guide to designing a complete data platform. The book carefully guides you through each layer of the architecture, from storage and ingestion to governance and security. It uses hands-on examples, such as ingesting data with Apache Spark and building dashboards in Apache Superset, to illustrate key concepts.
After reading this book, you will understand how to design and build a data platform that can grow with your needs. It explains the key choices you need to make when working with large amounts of data in real projects.
This book is written for data architects who already know the basics of data Lakehouses and are planning a new system or moving from an existing one.
Reviews / Votes
"This book definitely plugs a very important gap, where there are not a lot of resources out there to help the reader build/migrate to a data platform. It also proposes an alternative architecture to cater to the growing data science and analytics use cases."Nirmal A, Senior SDE, Amazon
More details
Language
English
Place of publication
New York
United States
Target group
Professional and scholarly
Dimensions
Height: 236 mm
Width: 186 mm
Thickness: 22 mm
Weight
734 gr
ISBN-13
978-1-63343-510-0 (9781633435100)
Copyright in bibliographic data and cover images is held by Nielsen Book Services Limited or by the publishers or by their respective licensors: all rights reserved.
Schweitzer Classification
Person
Alex Merced is Head of Developer Relations at Dremio, known for helping developers navigate modern data architectures. He has extensive experience in creating helpful and practical content. Alex translates complex architectural concepts into actionable guidance that helps readers build effective data platforms.
Content
PART 1: THE VALUE OF THE APACHE ICEBERG LAKEHOUSE
1 THE WORLD OF THE APACHE ICEBERG LAKEHOUSE
2 HANDS-ON WITH APACHE ICEBERG
PART 2: DESIGNING YOUR ICEBERG ARCHITECTURE
3 PREPARING FOR YOUR MOVE TO APACHE ICEBERG
4 SELECTING THE STORAGE LAYER
5 ARCHITECTING THE INGESTION LAYER
6 IMPLEMENTING THE CATALOG LAYER
7 DESIGNING THE FEDERATION LAYER
8 UNDERSTANDING THE CONSUMPTION LAYER
PART 3: OPERATING YOUR APACHE ICEBERG LAKEHOUSE
9 MAINTAINING AN ICEBERG LAKEHOUSE
10 OPERATIONALIZING APACHE ICEBERG
APPENDIXES
APPENDIX A: THE METADATA TABLES
APPENDIX B: PYTHON FOR APACHE ICEBERG
APPENDIX C: THE APACHE ICEBERG SPECIFICATION
1 THE WORLD OF THE APACHE ICEBERG LAKEHOUSE
2 HANDS-ON WITH APACHE ICEBERG
PART 2: DESIGNING YOUR ICEBERG ARCHITECTURE
3 PREPARING FOR YOUR MOVE TO APACHE ICEBERG
4 SELECTING THE STORAGE LAYER
5 ARCHITECTING THE INGESTION LAYER
6 IMPLEMENTING THE CATALOG LAYER
7 DESIGNING THE FEDERATION LAYER
8 UNDERSTANDING THE CONSUMPTION LAYER
PART 3: OPERATING YOUR APACHE ICEBERG LAKEHOUSE
9 MAINTAINING AN ICEBERG LAKEHOUSE
10 OPERATIONALIZING APACHE ICEBERG
APPENDIXES
APPENDIX A: THE METADATA TABLES
APPENDIX B: PYTHON FOR APACHE ICEBERG
APPENDIX C: THE APACHE ICEBERG SPECIFICATION