Have a personal or library account? Click to login
Data Engineering with Apache Spark, Delta Lake, and Lakehouse Cover

Data Engineering with Apache Spark, Delta Lake, and Lakehouse

Create scalable pipelines that ingest, curate, and aggregate complex data in a timely and secure way

Paid access
|Nov 2021
Product purchase options

Authors

Kukreja Manoj :

Manoj Kukreja is a Principal Architect at Northbay Solutions who specializes in creating complex Data Lakes and Data Analytics Pipelines for large-scale organizations such as banks, insurance companies, universities, and US/Canadian government agencies. Previously, he worked for Pythian, a large managed service provider where he was leading the MySQL and MongoDB DBA group and supporting large-scale data infrastructure for enterprises across the globe. With over 25 years of IT experience, he has delivered Data Lake solutions using all major cloud providers including AWS, Azure, GCP, and Alibaba Cloud. On weekends, he trains groups of aspiring Data Engineers and Data Scientists on Hadoop, Spark, Kafka and Data Analytics on AWS and Azure Cloud.

PDF ISBN: 978-1-80107-432-2
Publisher: Packt Publishing Limited
Copyright owner: © 2021 Packt Publishing Limited
Publication date: 2021
Language: English
Pages: 480