Tags » Hadoop

Factors of Interest in tuning shuffle phase to maximize the performance of MapReduce

“The shuffle is the heart of MapReduce and is where the “magic” happens.The shuffle is an area of the codebase where refinements and improvements are continually being made.” 400 more words
Map- Reduce

Big Data Analytics Presentation for SQL Saturday Orlando

Thanks to all for joining my session on Big Data Analytics at Seminole State College in Sanford, FL for the SQL Saturday event. I’ve uploaded my slides to… 6 more words

Big Data Analytics

Hadoop Offerings

The table below provides an overview of a selection of the players in the market.

Vendor Product/Offering Value-Add ontop of Hadoop Apache Hadoop (Core) Committers or PMC Uses Hadoop Internally? 581 more words


Managing a Hadoop Cluster


  1. Introduction
  2. Goals for this Module
  3. Outline
  4. Basic Setup
    1. Java Requirements
    2. Operating System
    3. Downloading and Installing Hadoop
  5. Important Directories
  6. Selecting Machines
  7. Cluster Configurations
    1. Small Clusters: 2-10 Nodes…
  8. 5,841 more words
Big Data