Tags » Hadoop

Managing a Hadoop Cluster


  1. Introduction
  2. Goals for this Module
  3. Outline
  4. Basic Setup
    1. Java Requirements
    2. Operating System
    3. Downloading and Installing Hadoop
  5. Important Directories
  6. Selecting Machines
  7. Cluster Configurations
    1. Small Clusters: 2-10 Nodes…
  8. 5,841 more words
Big Data

The Hadoop Approach

Hadoop is designed to efficiently process large volumes of information by connecting many commodity computers together to work in parallel. The theoretical 1000-CPU machine described earlier would cost a very large amount of money, far more than 1,000 single-CPU or 250 quad-core machines. 961 more words

Big Data

What is MapReduce? & Role of MapReduce on Bigdata(With Hadoop)

  1. Introduction
  2. Goals for this Module
  3. Outline
  4. Prerequisites
  5. MapReduce Basics
    1. Functional Programming Concepts
    2. List Processing
    3. Mapping Lists
    4. Reducing Lists
    5. Putting them Together in MapReduce
    6. An Example Application: Word Count…
  6. 7,167 more words

Hadoop Live Online Training at GetTrainedForJob.com

Big Data Hadoop is an open source framework which is written in java by apche software foundation. This Hadoop framework is used to write software applications to process vast amount of data. 391 more words


Pig Tutorial

Pig Scalar DataTypes

Complete types
MAP: A map in Pig is a chararray to data element mapping, where that element can be any Pig type, including a complex type. 275 more words


Apache Pig in a blog - Part I

Pig is an Apache open source project in the Hadoop ecosystem, that can be used to write parallelized dataflows on top of Hadoop.

Pig vs Hive… 848 more words


Build Hadoop-2.2.0 Source on windows and Configure in Eclipse

We can now build hadoop source version 2.2.0 on windows and configure it to use in eclipse. Follow the steps mentioned below to configure hadoop source on windows. 558 more words