Book description
This concise yet comprehensive guide explains how to adopt a data lakehouse architecture to implement modern data platforms. It reviews the design considerations, challenges, and best practices for implementing a lakehouse and provides key insights into the ways that using a lakehouse can impact your data platform, from managing structured and unstructured data and supporting BI and AI/ML use cases to enabling more rigorous data governance and security measures.
Practical Lakehouse Architecture shows you how to:
- Understand key lakehouse concepts and features like transaction support, time travel, and schema evolution
- Understand the differences between traditional and lakehouse data architectures
- Differentiate between various file formats and table formats
- Design lakehouse architecture layers for storage, compute, metadata management, and data consumption
- Implement data governance and data security within the platform
- Evaluate technologies and decide on the best technology stack to implement the lakehouse for your use case
- Make critical design decisions and address practical challenges to build a future-ready data platform
- Start your lakehouse implementation journey and migrate data from existing systems to the lakehouse
Publisher resources
Table of contents
- Preface
- 1. Introduction to Lakehouse Architecture
- 2. Traditional Architectures and Modern Data Platforms
- 3. Storage: The Heart of the Lakehouse
- 4. Data Catalogs
- 5. Compute Engines for Lakehouse Architectures
- 6. Data (and AI) Governance and Security in Lakehouse Architecture
- 7. The Big Picture: Designing and Implementing a Lakehouse Platform
- 8. Lakehouse in the Real World
- 9. Lakehouse of the Future
- Index
- About the Author
Product information
- Title: Practical Lakehouse Architecture
- Author(s):
- Release date: July 2024
- Publisher(s): O'Reilly Media, Inc.
- ISBN: 9781098153014
You might also like
book
Building LLM Powered Applications
Get hands-on with GPT 3.5, GPT 4, LangChain, Llama 2, Falcon LLM and more, to build …
book
Designing Data-Intensive Applications
Data is at the center of many challenges in system design today. Difficult issues need to …
book
Developing Apps with GPT-4 and ChatGPT
This minibook is a comprehensive guide for Python developers who want to learn how to build …
audiobook
Software Architecture: The Hard Parts
There are no easy decisions in software architecture. Instead, there are many hard parts-difficult problems or …