Tags » Hadoop

Leveraging interactive data mining over Resilent Distributed Datasets (RDD)


We’re in a community that petabytes of transactional data are stored at large clusters, usually partnered with Hadoop. To analyze such data, it’s popular to use interactive data mining tools (e.g. 539 more words

Distributed Systems

Spark RDD's

Spark RDDs are very simple at the same time very important concept in Apache Spark. Most of you might be knowing the full form of RDD, it is… 624 more words

Big Data

What is Hadoop?!


Today,I’m going to throw some light about the basics of Hadoop in layman terms. Let us explore about it straight away!!

What is Hadoop? 645 more words


Watch out, Golden Corral!

Hadoop should be a ‘Data Buffet’ not a ‘Data Lake’. 620 more words


Nuances of dealing with Big Data using Hadoop Ecosystem - 2

Now since we have seen that the size of the data that we are dealing with is ‘Big’, we have to architect solutions to handle the size and still stay close to expectations in terms of processing times. 704 more words


HBase Installation and Configuration

This post covers the HBase installation and important configurations to get first run successful. You can refer HBase – An Introduction for getting the basic ideas about this No SQL framework. 502 more words

Big Data