Book description
Poor data quality can cause major problems for data teams, from breaking revenue-generating data pipelines to losing the trust of data consumers. Despite the importance of data quality, many data teams still struggle to avoid these issues—especially when their data is sourced from upstream workflows outside of their control. The solution: data contracts. Data contracts enable high-quality, well-governed data assets by documenting expectations of the data, establishing ownership of data assets, and then automatically enforcing these constraints within the CI/CD workflow.
This practical book introduces data contract architecture with a clear definition of data contracts, explains why the data industry needs them, and shares real-world use cases of data contracts in production. In addition, you'll learn how to implement components of the data contract architecture and understand how they're used in the data lifecycle. Finally, you'll build a case for implementing data contracts in your organization.
Authors Chad Sanderson and Mark Freeman will help you:
- Explore real-world applications of data contracts within the industry
- Understand how to apply each component of this architecture, such as CI/CD, monitoring, version control data, and more
- Learn how to implement data contracts using open source tools
- Examine ways to resolve data quality issues using data contract architecture
- Measure the impact of implementing a data contract in your organization
- Develop a strategy to determine how data contracts will be used in your organization
Publisher resources
Table of contents
- Brief Table of Contents (Not Yet Final)
- 1. Why the Industry Now Needs Data Contracts
- 2. Data Quality Isn’t About Pristine Data
-
3. The Challenges of Scaling Data Infrastructure
- How Data Development Is Not Like Software Development
- Core Challenges for Modern Data Engineering Teams
- Why Data Development Needs a Design Surface
- The Cost of Large-Scale Refactors
- The Dangers of Database Migrations
- The Role of Change Management in Data Quality
- How Infrastructure Needs Change at Scale
- Conclusion
- Additional Resources
- References
- 4. An Introduction to Data Contracts
- 5. The Data Contract Components: Data Assets and Contract Definition
- About the Authors
Product information
- Title: Data Contracts
- Author(s):
- Release date: August 2025
- Publisher(s): O'Reilly Media, Inc.
- ISBN: 9781098157630
You might also like
book
Driving Data Quality with Data Contracts
Everything you need to know to apply data contracts and build a truly data-driven organization that …
book
Data Governance: The Definitive Guide
As you move data to the cloud, you need to consider a comprehensive approach to data …
book
Data Quality Fundamentals
Do your product dashboards look funky? Are your quarterly reports stale? Is the data set you're …
book
Data Management at Scale, 2nd Edition
As data management continues to evolve rapidly, managing all of your data in a central place, …