
Data Engineering with Scala and Spark
Build streaming and batch pipelines that process massive amounts of data using Scala
Publisher:Packt Publishing Limited
By: Eric Tome, Rupam Bhattacharjee and David Radford
Paid access
|Sep 2024Table of Contents
- Scala Essentials for Data Engineers
- Environment Setup
- An Introduction to Apache Spark and Its APIs – DataFrame, Dataset, and Spark SQL
- Working with Databases
- Object Stores and Data Lakes
- Understanding Data Transformation
- Data Profiling and Data Quality
- Test-Driven Development, Code Health, and Maintainability
- CI/CD with GitHub
- Data Pipeline Orchestration
- Performance Tuning
- Building Batch Pipelines Using Spark and Scala
- Building Streaming Pipelines Using Spark and Scala
PDF ISBN: 978-1-80461-432-7
Publisher: Packt Publishing Limited
Copyright owner: © 2024 Packt Publishing Limited
Publication date: 2024
Language: English
Pages: 300
Related subjects:
