Book description
Data-driven insights are a key competitive advantage for any industry today, but deriving insights from raw data can still take days or weeks. Most organizations can’t scale data science teams fast enough to keep up with the growing amounts of data to transform. What’s the answer? Self-service data.
With this practical book, data engineers, data scientists, and team managers will learn how to build a self-service data science platform that helps anyone in your organization extract insights from data. Sandeep Uttamchandani provides a scorecard to track and address bottlenecks that slow down time to insight across data discovery, transformation, processing, and production. This book bridges the gap between data scientists bottlenecked by engineering realities and data engineers unclear about ways to make self-service work.
- Build a self-service portal to support data discovery, quality, lineage, and governance
- Select the best approach for each self-service capability using open source cloud technologies
- Tailor self-service for the people, processes, and technology maturity of your data platform
- Implement capabilities to democratize data and reduce time to insight
- Scale your self-service portal to support a large number of users within your organization
Publisher resources
Table of contents
- Preface
- 1. Introduction
- I. Self-Service Data Discovery
- 2. Metadata Catalog Service
- 3. Search Service
- 4. Feature Store Service
- 5. Data Movement Service
- 6. Clickstream Tracking Service
- II. Self-Service Data Prep
- 7. Data Lake Management Service
- 8. Data Wrangling Service
- 9. Data Rights Governance Service
- III. Self-Service Build
- 10. Data Virtualization Service
- 11. Data Transformation Service
- 12. Model Training Service
- 13. Continuous Integration Service
- 14. A/B Testing Service
- IV. Self-Service Operationalize
- 15. Query Optimization Service
- 16. Pipeline Orchestration Service
- 17. Model Deploy Service
- 18. Quality Observability Service
- 19. Cost Management Service
- Index
- About the Author
Product information
- Title: The Self-Service Data Roadmap
- Author(s):
- Release date: September 2020
- Publisher(s): O'Reilly Media, Inc.
- ISBN: 9781492075257
You might also like
book
Data Management at Scale
As data management and integration continue to evolve rapidly, storing all your data in one place, …
book
Data Strategy
A well thought out, fit-for-purpose data strategy is vital to modern data-driven businesses. This book is …
book
Data Management at Scale, 2nd Edition
As data management continues to evolve rapidly, managing all of your data in a central place, …
book
Data Governance: The Definitive Guide
As you move data to the cloud, you need to consider a comprehensive approach to data …