Book description
Discover actionable steps to maintain healthy data pipelines to promote data observability within your teams with this essential guide to elevating data engineering practices
Key Features
- Learn how to monitor your data pipelines in a scalable way
- Apply real-life use cases and projects to gain hands-on experience in implementing data observability
- Instil trust in your pipelines among data producers and consumers alike
- Purchase of the print or Kindle book includes a free PDF eBook
Book Description
In the age of information, strategic management of data is critical to organizational success. The constant challenge lies in maintaining data accuracy and preventing data pipelines from breaking. Data Observability for Data Engineering is your definitive guide to implementing data observability successfully in your organization.
This book unveils the power of data observability, a fusion of techniques and methods that allow you to monitor and validate the health of your data. You’ll see how it builds on data quality monitoring and understand its significance from the data engineering perspective. Once you're familiar with the techniques and elements of data observability, you'll get hands-on with a practical Python project to reinforce what you've learned. Toward the end of the book, you’ll apply your expertise to explore diverse use cases and experiment with projects to seamlessly implement data observability in your organization.
Equipped with the mastery of data observability intricacies, you’ll be able to make your organization future-ready and resilient and never worry about the quality of your data pipelines again.
What you will learn
- Implement a data observability approach to enhance the quality of data pipelines
- Collect and analyze key metrics through coding examples
- Apply monkey patching in a Python module
- Manage the costs and risks associated with your data pipeline
- Understand the main techniques for collecting observability metrics
- Implement monitoring techniques for analytics pipelines in production
- Build and maintain a statistics engine continuously
Who this book is for
This book is for data engineers, data architects, data analysts, and data scientists who have encountered issues with broken data pipelines or dashboards. Organizations seeking to adopt data observability practices and managers responsible for data quality and processes will find this book especially useful to increase the confidence of data consumers and raise awareness among producers regarding their data pipelines.
Table of contents
- Data Observability for Data Engineering
- Contributors
- About the authors
- About the reviewers
- Preface
- Part 1: Introduction to Data Observability
- Chapter 1: Fundamentals of Data Quality Monitoring
- Chapter 2: Fundamentals of Data Observability
- Part 2: Implementing Data Observability
- Chapter 3: Data Observability Techniques
- Chapter 4: Data Observability Elements
- Chapter 5: Defining Rules on Indicators
- Part 3: How to adopt Data Observability in your organization
- Chapter 6: Root Cause Analysis
- Chapter 7: Optimizing Data Pipelines
- Chapter 8: Organizing Data Teams and Measuring the Success of Data Observability
- Part 4: Appendix
- Chapter 9: Data Observability Checklist
-
Chapter 10: Pathway to Data Observability
-
Technical roadmap to include data observability
- Allocating the right resources to your data observability project
- Defining clear objectives with the team
- Choosing a data pipeline
- Setting success criteria with the team and stakeholders
- Implementing data observability in applications
- Continuously improving observability
- Scaling data observability
- Using observability for data catalogs
- Using observability to ensure ML and AI reliability
- Using observability to complete a data quality management program
- Implementing data observability in a project
- Summary
-
Technical roadmap to include data observability
- Index
- Other Books You May Enjoy
Product information
- Title: Data Observability for Data Engineering
- Author(s):
- Release date: December 2023
- Publisher(s): Packt Publishing
- ISBN: 9781804616024
You might also like
audiobook
Fundamentals of Data Engineering
Data engineering has grown rapidly in the past decade, leaving many software engineers, data scientists, and …
book
Fundamentals of Data Engineering
Data engineering has grown rapidly in the past decade, leaving many software engineers, data scientists, and …
book
Fundamentals of Data Observability
Quickly detect, troubleshoot, and prevent a wide range of data issues through data observability, a set …
book
Data Science from Scratch, 2nd Edition
To really learn data science, you should not only master the tools—data science libraries, frameworks, modules, …