Tags » Cloudera

Hortonworks IPO - Why Now?

Last week, many observers were surprised when Hortonworks’ S1 for an initial public offering (IPO) was filed. And there are good reasons to be surprised. 50 more words

Data Management

Cloudera Certified Administrator for Apache Hadoop CDH4 Upgrade Exam (CCAH)

CCA-470 exam full name is Cloudera Certified Administrator for Apache Hadoop CDH4 Upgrade Exam (CCAH), is Cloudera popular certification courses. Obtained Cloudera certificates, can help you get better jobs. 446 more words


What it takes for Big Data

Here I write a quick intro to Big data, why should one be aware of it and what does it mean to one.  Big data simply means a large amount of data. 813 more words

Big Data

Why the Hortonworks IPO could be a bellwether for Hadoop

Hadoop vendor Hortonworks filed for its initial public offering on Monday in a move that could turn out to be a good indicator of how big the market for the open-source big data software will be in the near term. 770 more words

Bayesian Machine Learning on Apache Spark

Markov Chain Monte Carlo methods are another example of useful statistical computation for Big Data that is capably enabled by Apache Spark.

During my internship at Cloudera, I have been working on integrating PyMC with… 71 more words


SAS in Hadoop: An Update

SAS supports several different products that run “inside” Hadoop based on two different in-memory architectures:

(1) The SAS High Performance Analytics suite, originally designed to run in dedicated Teradata and Greenplum appliances, includes five modules: Statistics, Data Mining, Text Mining, Econometrics and Optimization. 723 more words


IBM Big SQL Benchmark vs. Cloudera Impala and Hortonworks Hive/Tez

Earlier this year I blogged about Cloudera’s “benchmarketing” efforts which showed Impala running a 20 query subset of the industry standard TPC-DS benchmark.

Well now its time to blog about IBM Big SQL running all 99 Hadoop-DS queries. 275 more words