Tags » Cloudera

Tutorial: Install CDH 5 for testing on one machine

This is a tutorial after my own experience to install CDH 5.4 via the Cloudera Manager on one machine only for test purposes. This is based on a Mint machine (based on Ubuntu/Debian). 807 more words


Trillium Software And Cloudera Collaborate To Deliver Data Quality Solutions For Hadoop

Cloudera and Trillium Software recently announced a collaboration whereby the Trillium Big Data solution is certified for Cloudera’s Hadoop distribution. As a result of the partnership, Cloudera customers can take advantage of Trillium’s data quality solutions to profile, cleanse, de-duplicate and enrich Hadoop-based data. 153 more words

Big Data

How to install HDFS and Hadoop using CDH3

Cloudera’s Distribution (CDH) provides streamlined installation of Apache Hadoop via Cloudera Manager. Besides Apache Hadoop, CDH also allows installation of other components such as Hive, Pig, HBase, ZooKeeper, etc. 20 more words

Hadoop adoption hurt by hype. Hey, this stuff is hard!

Even people who are not data scientists know that Hadoop is a big deal, even if they don’t know exactly what it is. Hadoop in many people’s eyes equals big data. 449 more words


MarkLogic snags $102 million in new funding to push its database abroad

MarkLogic now has $102 million in fresh Series F cash, bringing total funding to date to $173 million and—according to a spokeswoman—a $1 billion pre-money valuation. 205 more words


14 Things you didn't know about Cloudera

It’s fair that we can’t really start this without a little introduction to Cloudera. They were really bumped up our radar this year after topping $100 million in revenue in 2015. 768 more words


Hadoop Nedir? Mapreduce Nedir?

Hadoop’un başlangıcı 1990ların sonu 2000lerin başında Google çalışmalarına gidiyor. Google 2003 senesinde Google File Sistemini çıkarıyor. 2004 yılında Map Reduce ortaya çıkıyor.

Ana prensipleri Developerlar’ın network programlamasında çok uğraşmamaları, Developerların nodeların birbiri ile konuşmaları için minimum uğraşmaları,Nodeların birbiri ile minimum haberleşmeleri, Datanın kopyalanması sayesinde hem kullanılabilirlik hemde ulaşılabilirlik artacaktır. 479 more words

Big Data