The ETL blueprint

AWS Glue allows you to implement an extract, transform, and load process from start to finish. This pattern is a common methodology for processing large amounts of data. In the coming sections, we're going to step through an example using sample weather data.

To get an understanding about what we are about to build from a high level, study the following diagram:

ETL process showing components and high-level data flows

As you can see in the blueprint diagram, our data source will be a filesystem that holds the weather data files to be processed. In this case, the filesystem is your local machine.

You can download the sample ...

Get Learn AWS Serverless Computing now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.