
Essential PySpark for Scalable Data Analytics
A beginner's guide to harnessing the power and ease of PySpark 3
Publisher:Packt Publishing Limited
Paid access
|Jun 2024Table of Contents
- Distributed Computing Primer
- Data Ingestion
- Data Cleansing and Integration
- Real-time Data Analytics
- Scalable Machine Learning with PySpark
- Feature Engineering – Extraction, Transformation, and Selection
- Supervised Machine Learning
- Unsupervised Machine Learning
- Machine Learning Life Cycle Management
- Scaling Out Single-Node Machine Learning Using PySpark
- Data Visualization with PySpark
- Spark SQL Primer
- Integrating External Tools with Spark SQL
- The Data Lakehouse
PDF ISBN: 978-1-80056-309-4
Publisher: Packt Publishing Limited
Copyright owner: © 2021 Packt Publishing Limited
Publication date: 2024
Language: English
Pages: 322
Related subjects:
