
Data Engineering with Python
Work with massive datasets to design data models and automate data pipelines using Python
Publisher:Packt Publishing Limited
By: Paul Crickard
Paid access
|Sep 2024Table of Contents
- What is Data Engineering?
- Building Our Data Engineering Infrastructure
- Reading and Writing Files
- Working with Databases
- Cleaning, Transforming, and Enriching Data
- Building a 311 Data Pipeline
- Features of a Production Pipeline
- Version Control Using the NiFi Registry
- Monitoring and Logging Pipelines
- Deploying Your Pipelines
- Building a Production Data Pipeline
- Building a Kafka Cluster
- Streaming Data with Apache Kafka
- Data Processing with Apache Spark
- Real-Time Edge Data with MiNiFi, Kafka, and Spark
- Appendix
PDF ISBN: 978-1-83921-230-7
Publisher: Packt Publishing Limited
Copyright owner: © 2020 Packt Publishing Limited
Publication date: 2024
Language: English
Pages: 356
Related subjects:
