Tags » Cloudera

Amazing Data Warehousing with Hadoop and Big Data

Many thanks for reading, and don’t forget, please join The Big Data Contrarians.

Some time back, Bill Inmon, the father of Data Warehousing, took the Hadoop vendor Cloudera to task for putting out some confusing advertising. 741 more words

Big Data

Hadoop : Cloudera, Hortonworks ou MapR ?

Pour commencer à utiliser Hadoop, le plus simple est d’installer une distribution ayant déjà les différentes briques fonctionnant ensemble.

Les trois principales sont Cloudera, Hortonworks ou MapR, chacune ayant ses spécificités. 126 more words

Hadoop

Cloudera Certified Developer for Apache Hadoop (CCDH)

Happy to share the earning of CCDH certification  :-)

Verification URL:  http://certification.cloudera.com/verify   (with License # 100-013-285)

Loads of conceptual as well as programming questions that include multiple choice questions as well. 54 more words

Technical

Installing Cloudera Manager in an existing hadoop cluster

Cloudera Manager is an Infrastructure management and monitoring tool provided by cloudera. This has now became a very excellent tool to manage bigdata infrastructure. The pain of administrators has been reduced by 80% with this cloudera manager. 204 more words

BigData World

Gerrit Code Review and Jenkins Continuous Delivery Pipeline on BigData

Gerrit at the Jenkins User Conference 2015 – London

For the very first time, CloudBees organised a full User Conference in London and we have been… 1,438 more words

Code-review

Cloudera Director for a Quick Hadoop Cluster on AWS !

Steps to deploy a cluster using the Cloudera Director on AWS.

Step 0:

Setting up the AWS environment, -> create a VPC and subnet within it then a security group associated…

416 more words
Big Data & Realtime

Building a Faster ETL Pipeline with Flume, Kafka, and Hive

At WordPress.com we process a lot of events including some some events that are batched and sent asynchronously sometimes days later. But when querying this data we are likely to care more about when the events occurred rather then when it was sent to our servers. 2,000 more words