Book description
Use easy-to-apply patterns in SQL and Python to adopt modern analytics engineering to build agile platforms with dbt that are well-tested and simple to extend and run Purchase of the print or Kindle book includes a free PDF eBook
Key Features
- Build a solid dbt base and learn data modeling and the modern data stack to become an analytics engineer
- Build automated and reliable pipelines to deploy, test, run, and monitor ELTs with dbt Cloud
- Guided dbt + Snowflake project to build a pattern-based architecture that delivers reliable datasets
Book Description
dbt Cloud helps professional analytics engineers automate the application of powerful and proven patterns to transform data from ingestion to delivery, enabling real DataOps.
This book begins by introducing you to dbt and its role in the data stack, along with how it uses simple SQL to build your data platform, helping you and your team work better together. You’ll find out how to leverage data modeling, data quality, master data management, and more to build a simple-to-understand and future-proof solution. As you advance, you’ll explore the modern data stack, understand how data-related careers are changing, and see how dbt enables this transition into the emerging role of an analytics engineer. The chapters help you build a sample project using the free version of dbt Cloud, Snowflake, and GitHub to create a professional DevOps setup with continuous integration, automated deployment, ELT run, scheduling, and monitoring, solving practical cases you encounter in your daily work.
By the end of this dbt book, you’ll be able to build an end-to-end pragmatic data platform by ingesting data exported from your source systems, coding the needed transformations, including master data and the desired business rules, and building well-formed dimensional models or wide tables that’ll enable you to build reports with the BI tool of your choice.
What you will learn
- Create a dbt Cloud account and understand the ELT workflow
- Combine Snowflake and dbt for building modern data engineering pipelines
- Use SQL to transform raw data into usable data, and test its accuracy
- Write dbt macros and use Jinja to apply software engineering principles
- Test data and transformations to ensure reliability and data quality
- Build a lightweight pragmatic data platform using proven patterns
- Write easy-to-maintain idempotent code using dbt materialization
Who this book is for
This book is for data engineers, analytics engineers, BI professionals, and data analysts who want to learn how to build simple, futureproof, and maintainable data platforms in an agile way. Project managers, data team managers, and decision makers looking to understand the importance of building a data platform and foster a culture of high-performing data teams will also find this book useful. Basic knowledge of SQL and data modeling will help you get the most out of the many layers of this book. The book also includes primers on many data-related subjects to help juniors get started.
Table of contents
- Data Engineering with dbt
- Contributors
- About the author
- About the reviewers
- Preface
- Part 1: The Foundations of Data Engineering
- Chapter 1: The Basics of SQL to Transform Data
- Chapter 2: Setting Up Your dbt Cloud Development Environment
- Chapter 3: Data Modeling for Data Engineering
- Chapter 4: Analytics Engineering as the New Core of Data Engineering
-
Chapter 5: Transforming Data with dbt
- Technical requirements
- The dbt Core workflow for ingesting and transforming data
- Introducing our stock tracking project
- Defining data sources and providing reference data
-
How to write and test transformations
- Writing the first dbt model
- Real-time lineage and project navigation
- Deploying the first dbt model
- Committing the first dbt model
- Configuring our project and where we store data
- Re-deploying our environment to the desired schema
- Configuring the layers for our architecture
- Ensuring data quality with tests
- Generating the documentation
- Summary
- Part 2: Agile Data Engineering with dbt
- Chapter 6: Writing Maintainable Code
-
Chapter 7: Working with Dimensional Data
- Adding dimensional data
- Loading the data of the first dimension
- Building the STG model for the first dimension
- Saving history for the dimensional data
- Building the REF layer with the dimensional data
- Adding the dimensional data to the data mart
- Exercise – adding a few more hand-maintained dimensions
- Summary
- Chapter 8: Delivering Consistency in Your Data
- Chapter 9: Delivering Reliability in Your Data
-
Chapter 10: Agile Development
- Technical requirements
- Agile development and collaboration
- Applying agile to data engineering
-
Building reports in an agile way
- S1 – designing a light data model for the data mart
- S2 – designing a light data model for the REF layer
- S3.x – developing with dbt models the pipeline for the XYZ table
- S4 – an acceptance test of the data produced in the data mart
- S5 – development and verification of the report in the BI application
- Summary
- Chapter 11: Team Collaboration
- Part 3: Hands-On Best Practices for Simple, Future-Proof Data Platforms
- Chapter 12: Deployment, Execution, and Documentation Automation
- Chapter 13: Moving Beyond the Basics
- Chapter 14: Enhancing Software Quality
- Chapter 15: Patterns for Frequent Use Cases
- Index
- Other Books You May Enjoy
Product information
- Title: Data Engineering with dbt
- Author(s):
- Release date: June 2023
- Publisher(s): Packt Publishing
- ISBN: 9781803246284
You might also like
book
Deciphering Data Architectures
Data fabric, data lakehouse, and data mesh have recently appeared as viable alternatives to the modern …
audiobook
Fundamentals of Data Engineering
Data engineering has grown rapidly in the past decade, leaving many software engineers, data scientists, and …
book
Fundamentals of Data Engineering
Data engineering has grown rapidly in the past decade, leaving many software engineers, data scientists, and …
book
Prompt Engineering for Generative AI
Large language models (LLMs) and diffusion models such as ChatGPT and Stable Diffusion have unprecedented potential. …