Learning Hadoop 2

Design and implement data processing, lifecycle management, and analytic workflows with the cutting-edge toolbox of Hadoop 2

Publisher:Packt Publishing Limited

By: GABRIELE MODENA

Paid access

|Jan 2025

E-Book €34.99Institutions €124.95

Description

Key Features

Book Description

If you are a system or application developer interested in learning how to solve practical problems using the Hadoop framework, then this book is ideal for you. You are expected to be familiar with the Unix/Linux command-line interface and have some experience with the Java programming language. Familiarity with Hadoop would be a plus.

What you will learn

Write distributed applications using the MapReduce framework
Go beyond MapReduce and process data in real time with Samza and iteratively with Spark
Familiarize yourself with data mining approaches that work with very large datasets
Prototype applications on a VM and deploy them to a local cluster or to a cloud infrastructure (Amazon Web Services)
Conduct batch and real time data analysis using SQLlike tools
Build data processing flows using Apache Pig and see how it enables the easy incorporation of custom functionality
Define and orchestrate complex workflows and pipelines with Apache Oozie
Manage your data lifecycle and changes over time

Who this book is for

If you are a system or application developer interested in learning how to solve practical problems using the Hadoop framework, then this book is ideal for you. You are expected to be familiar with the Unix/Linux command-line interface and have some experience with the Java programming language. Familiarity with Hadoop would be a plus.

Learning Hadoop 2

Key Features

Book Description

What you will learn

Who this book is for

Table of Contents

People also read

Publications carousel

Hadoop: Data Processing and Modelling

Hadoop: Data Processing and Modelling

Apache Hadoop 3 Quick Start Guide

Mastering Hadoop 3

Hadoop 2.x Administration Cookbook

Big Data and Hadoop - 2nd Edition

Hadoop实际解决方案手册

面向MapReduce的Hadoop优化

Deep Learning with Hadoop

Paradigm

My account