Tags » Hadoop

Run a simple map reduce job in hadoop pseudo distributed setup.

“In this blog I will describe, how you can run a simple map reduce job in a single-node Hadoop cluster. for this i am going to use a WordCountexample which reads text files and counts how often words occur.

685 more words

Install & Configure Apache Hadoop 2.x.x On Ubuntu (Single Node Cluster or Pseudo Distributed Setup)

“In this blog I will describe the steps for setting up a single-node Hadoop cluster backed by the Hadoop Distributed File System, running on Ubuntu Linux”

1,452 more words

Reduce side join in hadoop map reduce framework

In this blog I am going to show, how you can us the Reduce side joins in map reduce framework of hadoop.

Why we need to do data join ??

1,248 more words

Write a map reduce job in hadoop

In this tutorial I will describe how you can write your own simple map reduce job and run in a single-node Hadoop cluster. for this we are going to create our own map reduce job so we ‘ll solve the following exercise… 1,064 more words




How it is useful: This is useful to connect hortonworks stream data from HCatalog to bring to MS excel to visualize

Step1: Get hold on ip address of the VM box… 140 more words


HortonWorks Sandbox 2.1 Twitter Sentiment example: Failed : execution error . return code from org.apache.hadoop.hive hortonworks

At first the instruction on the http://hortonworks.com/hadoop-tutorial/how-to-refine-and-visualize-sentiment-data/

has a small error : You need to to copy the json-serde-1.1.6-SNAPSHOT-jar-with-dependencies.jar as well along with the hiveddl.sql… 136 more words


The Hortonworks IPO, not adding up for me. #BigData #Hadoop

There’s Money In Those Elephants….

I have to admit I got a little excited when I saw a small piece on the launchticker.com newsletter about… 788 more words