Have a personal or library account? Click to login
Spark Cookbook Cover

Spark Cookbook

With over 60 recipes on Spark, covering Spark Core, Spark SQL, Spark Streaming, MLlib, and GraphX libraries this is the perfect Spark book to always have by your side

Paid access
|Sep 2025
Product purchase options

Key Features

    Book Description

    What you will learn

    • Install and configure Apache Spark with various cluster managers
    • Set up development environments
    • Perform interactive queries using Spark SQL
    • Get to grips with realtime streaming analytics using Spark Streaming
    • Master supervised learning and unsupervised learning using MLlib
    • Build a recommendation engine using MLlib
    • Develop a set of common applications or project types, and solutions that solve complex big data problems
    • Use Apache Spark as your single big data compute platform and master its libraries

    Who this book is for

    If you are a data engineer, an application developer, or a data scientist who would like to leverage the power of Apache Spark to get better insights from big data, then this is the book for you.

    Table of Contents

    1. Getting Started with Spark
    2. Developing Applications with Spark
    3. External data sources
    4. Spark SQL
    5. Spark Streaming
    6. Getting started with Machine Learning
    7. Supervised Learning with Mllib
    8. Supervised Learning with MLlib
    9. Unsupervised Learning
    10. Recommender Systems
    11. Graph Processing
    12. Optimization and Performance Tuning
    PDF ISBN: 978-1-78398-707-8
    Publisher: Packt Publishing Limited
    Copyright owner: © 2015 Packt Publishing Limited
    Publication date: 2025
    Language: English
    Pages: 226