2
Scalable Data Lakes
In this chapter, we will look at how organizations can build a data platform foundation by creating data lakes on AWS.
We will cover the following main topics:
- Why choose Amazon S3 as a data lake store?
- Business scenario setup
- Data lake layers
- Data lake patterns
- Data catalogs
- Transactional data lakes
- Putting it all together
Why choose Amazon S3 as a data lake store?
Before we dive deep into the actual data and analytics use cases and explore how to design data lakes on AWS, it is first important to understand why Amazon Simple Storage Service (Amazon S3) is the preferred choice for building a data lake and why it is used as a storage layer to store all kinds of data in a centralized location.
If you recall from the discussions ...
Get Modern Data Architecture on AWS now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.