Tags » Hadoop

Open Source Cloud Formation with Minotaur for Mesos, Kafka and Hadoop

Today I am happy to announce “Minotaur” which is our Open Source AWS based infrastructure for managing big data open source projects including (but not limited too): … 1,038 more words

Open Source Projects

Ideas and goals behind the Go Kafka Client

I think a bunch of folks have heard already that B.D.O.S.S. was working on a new Apache Kafka Client For Go. Go Kafka Client was open sourced last Friday. 755 more words

Open Source Projects

Run a simple map reduce job in hadoop pseudo distributed setup.

“In this blog I will describe, how you can run a simple map reduce job in a single-node Hadoop cluster. for this i am going to use a WordCountexample which reads text files and counts how often words occur.

685 more words
Map-reduce

Install & Configure Apache Hadoop 2.x.x On Ubuntu (Single Node Cluster or Pseudo Distributed Setup)

“In this blog I will describe the steps for setting up a single-node Hadoop cluster backed by the Hadoop Distributed File System, running on Ubuntu Linux”

1,452 more words
Hadoop

Reduce side join in hadoop map reduce framework

In this blog I am going to show, how you can us the Reduce side joins in map reduce framework of hadoop.

Why we need to do data join ??

1,248 more words
Map-reduce

Write a map reduce job in hadoop

In this tutorial I will describe how you can write your own simple map reduce job and run in a single-node Hadoop cluster. for this we are going to create our own map reduce job so we ‘ll solve the following exercise… 1,064 more words

Map-reduce

How-to-install-and-configure-the-hortonworks-odbc-driver-on-windows-8

How-to-install-and-configure-the-hortonworks-odbc-driver-on-windows-8

How it is useful: This is useful to connect hortonworks stream data from HCatalog to bring to MS excel to visualize

Step1: Get hold on ip address of the VM box… 140 more words

Hadoop