Building Your First ETL Data Platform
Published by O'Reilly Media, Inc.
Batch data extraction, warehousing, transformation, visualization, and analytics using off-the-shelf tools
Course outcomes
- Understand the architecture and components of a modern data platform
- Learn about the tools used for data extraction, warehousing, transformation, visualization, and analytics
- Explore what tools such as Fivetran, Airbyte, Snowflake, BigQuery, dbt, Tableau, and Hex offer
- Learn how to to assess the trade-offs between different tools and their suitability for your use case
Course description
Join expert Sam Bail to explore the foundations of building an ELT data platform from scratch using off-the-shelf tools from the modern data stack. You’ll gain an understanding of the basic architecture of a data platform using the extract-load-transform pattern for extracting data from various data sources and transforming them into analytical insights. You’ll look at common extraction tools such as Fivetran and Airbyte, data warehouses such as Snowflake and BigQuery, the data transformation tool dbt, and data visualization and analytics platforms such as Hex and Tableau, and you’ll understand how to assess the trade-offs between these tools with respect to cost, capability, and ease of use. You’ll also look at issues such as data quality, data cataloging, and “reverse ETL.”
What you’ll learn and how you can apply it
- Understand the basic architecture of an ETL/ELT data platform
- Choose the appropriate tools to use for the parts of a platform
- Design and build a simple analytics pipeline
This live event is for you because...
- You’re a data engineer who’s looking to learn the basics of the modern data stack and how to apply these tools to your business needs.
- You work on a data team that wants to switch to an ETL/ELT architecture.
Prerequisites
Schedule
The time frames are only estimates and may vary according to how the class is progressing.
Part I (80 minutes)
- Presentation: Data platform architecture; data extraction and data warehouse
- Q&A
- Break
Part II (90 minutes)
- Presentation: Data transformation with dbt; data analytics and visualization; data quality, cataloging, and reverse ETL at a glance
Wrap-up and Q&A (10 minutes)
Your Instructor
Sam Bail
Sam Bail is a data professional with a passion for turning high quality data into valuable insights. Sam holds a PhD in Computer Science and has worked for several data-focused startups. In her past role as Engineering Director at Superconductive, she worked on Great Expectations, an open source Python library for data validation and documentation.