Tags » Hadoop

Apache Crunch Toolkit #1: Easily Run Jobs in Apache Crunch

Apache Crunch

Apache Crunch is an incredibly useful Hadoop tool for extracting away the boilerplace produced by Java MapReduce jobs. Instead of clunky map() and… 940 more words


What can you Achieve with Big Data??

Dacadmy welcomes to all readers to the Big Data and Hadoop articles, this is a journey from data to Big Data Technologies. 697 more words


Hadoop 2.6 setup single node

Apache Hadoop 2.6 significant improvements over the previous stable 2.X.Y releases. This version has many improvements in HDFS and MapReduce. This how to guide will help you to install Hadoop 2.6 on CentOS/RHEL 7/6/5 and Ubuntu System. 890 more words


AWS EMR High Performance Bootstrap Actions

In this post, I describe some EMR bootstrap scripts that are especially helpful in ensuring that the Hadoop clusters run great. As a general rule, I use the c3.xlarge compute or m3.xlarge spot nodes and have been consistently deploying medium sized clusters (>30 nodes). 411 more words



There is no proper definition of Big Data, it is a kind of data.

Big Data is a collection of large and complex datasets that has become difficult to process using on-hand database management tools or traditional applications such as MySQL, Oracle, Teradata applications. 182 more words


DACADMY - Join Our Battle Against Big Data

HEYAA Fellas, it’s time to start our mission against simple and yet “complex in nature” – DATA.

We’ll be very thankful to our readers if they could first read this and get to know about us. 222 more words