Table of Contents
- Scala Essentials for Data Engineers
- Environment Setup
- An Introduction to Apache Spark and Its APIs – DataFrame, Dataset, and Spark SQL
- Working with Databases
- Object Stores and Data Lakes
- Understanding Data Transformation
- Data Profiling and Data Quality
- Test-Driven Development, Code Health, and Maintainability
- CI/CD with GitHub
- Data Pipeline Orchestration
- Performance Tuning
- Building Batch Pipelines Using Spark and Scala
- Building Streaming Pipelines Using Spark and Scala

