Table of Contents
- Introduction to Data Ingestion
- Principals of Data Access – Accessing your Data
- Data Discovery – Understanding Our Data Before Ingesting It
- Reading CSV and JSON Files and Solving Problems
- Ingesting Data from Structured and Unstructured Databases
- Using PySpark with Defined and Non-Defined Schemas
- Ingesting Analytical Data
- Designing Monitored Data Workflows
- Putting Everything Together with Airflow
- Logging and Monitoring Your Data Ingest in Airflow
- Automating Your Data Ingestion Pipelines
- Using Data Observability for Debugging, Error Handling, and Preventing Downtime

