Tags » Hadoop

Buổi Offline tháng 4 của Contanalytics team

Các bạn thân mến,

Tuần vừa qua thứ 7 ngày 19/04/2014 tại Trà Sữa 777, địa chỉ 111 Nguyễn Trọng Tuyển, P15, Q.Phú Nhuận. Contanalytics team đã tổ chức buổi Offline tháng 4 với chủ đề: Big Data Overview. 507 more words


Learning Hadoop!!

With referenece to my earlier post on “BigData & Hadoop newbies – Bookmarks!!“. Here are a few additional links for a complete Training plan for Hadoop. 179 more words


Writing a MapReduce Program and using mrUnit to test it

The advantage of using unit testing is essential when writing m/r jobs (in Java or using Streaming); Let’s imagine we want to write a m/r job that output the average length of the words in a text starting with that letter… 1,408 more words


Hadoop API Documentation Miscellaneous Tips

Due to the extremely poor Hadoop API documentation and confusion between the old and the new API, I am going to compile my own Hadoop API Documentation Tips: 221 more words


Hadoop map side join with Distributed Cache Example


Welcome all , today i am posting an example for MAP-SIDE JOIN using mapreduce.

first of all we need to prepare data samples for this map side join… 621 more words

Map Reduce

Setting up Development Environment for Hadoop MapReduce program in Mac OSX


Caution: Only Hadoop Old API (*.mapred.*) works in the local debugging mode, while the new API ( *.mapreduce.* ) does not work. It might be related to the Hadoop Command Line tools problem. 410 more words


Online vs Offline Bigdata solution

Big Data can take both online and offline forms. Online Big Data refers to data that is created, ingested, trans- formed, managed and/or analyzed in real-time to support operational applications and their users. 255 more words

Big Data