Tags » Hadoop

Incremental append and lastmodified in SQOOP.

Unofficially stating, SQOOP stands for SQl + hadOOP, which can be thought as Sql like interface to communicate with Hadoop.

Many a times in practical scenarios, if data stored on databses such as Oracle, MySql, DB2, etc becomes huge to analyse and get deep insight then these data need to be imported to Hadoop for analysis. 323 more words

Hadoop

MapReduce Execution in Hadoop

In this article we have tried to summaries,  how a MapReduce program executes in Hadoop environment.

MapReduce 1 Execution Sequence:

MapReduce execution starts with below command. 191 more words

Hadoop

Remember your daft ideas, well they're not that daft after all. (@foldingathome @WiredUK)

My office is littered with notebooks, mainly Moleskines as I’m a colossal hipster nerd (sans beard) and I only write in them with Uniball Eye Micro… 479 more words

Data And Statistics

Reflections beyond Big Data Week - #bigdata #bdwbelfast

I’ve had some time to reflect on a few things recently, one of which was the Big Data Week Belfast panel.

It was a delight to sit with my friend Tom Gray, CTO of Kainos, Adele Marshall director of research at Centre for Statistical Science at Queens University and Padraic Sheerin of the Prudential who had the unenviable task of keeping us all in order, he did a good job. 592 more words

Hadoop Vs Relational Database Management System

Hadoop or RDBMS? There are a lot of different opinions out there among these two technologies. Few people think Hadoop is a replacement of RDBMS, few say Hadoop is the future, and few thinks Hadoop have just got hype. 932 more words

Hadoop

HDFS Architecture

The Hadoop Distributed File System (HDFS) is a highly fault tolerant file system designed and optimized to be deployed on a distributed infrastructure established with a bunch commodity hardware. 352 more words

Hadoop

Discovery of Hadoop

Back in the early 2000’s Dough Cutting was attempting to build an Open Source Search engine called¬†Nutch. They were facing trouble managing their distributed system even when they were running on very few computers. 386 more words

Hadoop