Part 1: Fundamentals of Data Ingestion

In this part, you will be introduced to the fundamentals of data ingestion and data engineering, passing through the basic definition of an ingestion pipeline, the common types of data sources, and the technologies involved.

This part has the following chapters:

  • Chapter 1, Introduction to Data Ingestion
  • Chapter 2, Principals of Data Access – Accessing Your Data
  • Chapter 3, Data Discovery – Understanding Our Data Before Ingesting It
  • Chapter 4, Reading CSV and JSON Files and Solving Problems
  • Chapter 5, Ingesting Data from Structured and Unstructured Databases
  • Chapter 6, Using PySpark with Defined and Non-Defined Schemas
  • Chapter 7, Ingesting Analytical Data

Get Data Ingestion with Python Cookbook now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.