Book description
Poor data quality can cause major problems for data teams, from breaking revenue-generating data pipelines to losing the trust of data consumers. Despite the importance of data quality, many data teams still struggle to avoid these issues—especially when their data is sourced from upstream workflows outside of their control. The solution: data contracts. Data contracts enable high-quality, well-governed data assets by documenting expectations of the data, establishing ownership of data assets, and then automatically enforcing these constraints within the CI/CD workflow.
This practical book introduces data contract architecture with a clear definition of data contracts, explains why the data industry needs them, and shares real-world use cases of data contracts in production. In addition, you'll learn how to implement components of the data contract architecture and understand how they're used in the data lifecycle. Finally, you'll build a case for implementing data contracts in your organization.
Authors Chad Sanderson and Mark Freeman will help you:
- Explore real-world applications of data contracts within the industry
- Understand how to apply each component of this architecture, such as CI/CD, monitoring, version control data, and more
- Learn how to implement data contracts using open source tools
- Examine ways to resolve data quality issues using data contract architecture
- Measure the impact of implementing a data contract in your organization
- Develop a strategy to determine how data contracts will be used in your organization
Publisher resources
Table of contents
- Brief Table of Contents (Not Yet Final)
- 1. Why the Industry Now Needs Data Contracts
- 2. Data Quality Isn’t About Pristine Data
-
3. The Challenges of Scaling Data Infrastructure
- How Data Development Is Not Like Software Development
- Core Challenges for Modern Data Engineering Teams
- Why Data Development Needs a Design Surface
- The Cost of Large-Scale Refactors
- The Dangers of Database Migrations
- The Role of Change Management in Data Quality
- How Infrastructure Needs Change at Scale
- Conclusion
- Additional Resources
- References
- 4. An Introduction to Data Contracts
- 5. The Data Contract Components: Data Assets and Contract Definition
- About the Authors
Product information
- Title: Data Contracts
- Author(s):
- Release date: August 2025
- Publisher(s): O'Reilly Media, Inc.
- ISBN: 9781098157630
You might also like
book
Driving Data Quality with Data Contracts
Everything you need to know to apply data contracts and build a truly data-driven organization that …
book
Data Management at Scale
As data management and integration continue to evolve rapidly, storing all your data in one place, …
book
Financial Data Engineering
Today, investment in financial technology and digital transformation is reshaping the financial landscape and generating many …
book
The Self-Service Data Roadmap
Data-driven insights are a key competitive advantage for any industry today, but deriving insights from raw …