Book description
Discover all the essential design and architectural patterns in one place to help you rapidly build and deploy your modern data platform using AWS services
Key Features
- Learn to build modern data platforms on AWS using data lakes and purpose-built data services
- Uncover methods of applying security and governance across your data platform built on AWS
- Find out how to operationalize and optimize your data platform on AWS
- Purchase of the print or Kindle book includes a free PDF eBook
Book Description
Many IT leaders and professionals are adept at extracting data from a particular type of database and deriving value from it. However, designing and implementing an enterprise-wide holistic data platform with purpose-built data services, all seamlessly working in tandem with the least amount of manual intervention, still poses a challenge.
This book will help you explore end-to-end solutions to common data, analytics, and AI/ML use cases by leveraging AWS services. The chapters systematically take you through all the building blocks of a modern data platform, including data lakes, data warehouses, data ingestion patterns, data consumption patterns, data governance, and AI/ML patterns. Using real-world use cases, each chapter highlights the features and functionalities of numerous AWS services to enable you to create a scalable, flexible, performant, and cost-effective modern data platform.
By the end of this book, you’ll be equipped with all the necessary architectural patterns and be able to apply this knowledge to efficiently build a modern data platform for your organization using AWS services.
What you will learn
- Familiarize yourself with the building blocks of modern data architecture on AWS
- Discover how to create an end-to-end data platform on AWS
- Design data architectures for your own use cases using AWS services
- Ingest data from disparate sources into target data stores on AWS
- Build data pipelines, data sharing mechanisms, and data consumption patterns using AWS services
- Find out how to implement data governance using AWS services
Who this book is for
This book is for data architects, data engineers, and professionals creating data platforms. The book's use case–driven approach helps you conceptualize possible solutions to specific use cases, while also providing you with design patterns to build data platforms for any organization. It's beneficial for technical leaders and decision makers to understand their organization's data architecture and how each platform component serves business needs. A basic understanding of data & analytics architectures and systems is desirable along with beginner’s level understanding of AWS Cloud.
Table of contents
- Modern Data Architecture on AWS
- Contributors
- About the author
- About the reviewers
- Preface
- Part 1: Foundational Data Lake
- Prologue: The Data and Analytics Journey So Far
- Chapter 1: Modern Data Architecture on AWS
- Chapter 2: Scalable Data Lakes
- Part 2: Purpose-Built Services And Unified Data Access
- Chapter 3: Batch Data Ingestion
- Chapter 4: Streaming Data Ingestion
-
Chapter 5: Data Processing
-
Challenges with data processing platforms
- Challenge 1 – Fixed costs for an on-premises data processing platform
- Challenge 2 – Compute is always on, even when not required
- Challenge 3 – Tight coupling between the storage and compute layers
- Challenge 4 – Scalability issues
- Challenge 5 – Operational issues
- Challenge 6 – Limited capabilities
- Challenge 7 – Third-party vendor lock-in
- Data processing using Amazon EMR
- Data processing using AWS Glue
- Data processing using AWS Glue DataBrew
- Summary
- References
-
Challenges with data processing platforms
- Chapter 6: Interactive Analytics
- Chapter 7: Data Warehousing
- Chapter 8: Data Sharing
- Chapter 9: Data Federation
- Chapter 10: Predictive Analytics
- Chapter 11: Generative AI
- Chapter 12: Operational Analytics
- Chapter 13: Business Intelligence
- Part 3: Govern, Scale, Optimize And Operationalize
-
Chapter 14: Data Governance
- What is data governance?
- Data governance on AWS
- Data governance using Amazon DataZone
- Fine-grained access control using AWS Lake Formation
- Improving data quality using Glue Data Quality
- Sensitive data discovery with Amazon Macie
- Data collaborations with partners using AWS Clean Rooms
- Data resolution with AWS Entity Resolution
- Summary
- References
- Chapter 15: Data Mesh
- Chapter 16: Performant and Cost-Effective Data Platform
- Chapter 17: Automate, Operationalize, and Monetize
- Index
- Other Books You May Enjoy
Product information
- Title: Modern Data Architecture on AWS
- Author(s):
- Release date: August 2023
- Publisher(s): Packt Publishing
- ISBN: 9781801813396
You might also like
book
Serverless Development on AWS
The adoption of serverless is on the rise, but until now, little guidance has been available …
book
AWS for Solutions Architects
Apply cloud design patterns to overcome real-world challenges by building scalable, secure, highly available, and cost-effective …
video
AWS Certified Solutions Architect Associate (SAA-C03)
8+ Hours of Video Instruction 8 Hours of Video Instruction and Test-Taking Strategies for the Topics …
book
Generative AI on AWS
Companies today are moving rapidly to integrate generative AI into their products and services. But there's …