Tags » Cloudera

Hadoop distros are starting to diverge

A recent blog posting by Merv Adrian of Gartner, “Hadoop is in the mind of the beholder”  hit the nail on the head. While most commercial Hadoop distributions now include the same basic components (e.g. 176 more words

INFORMS Business Analytics 2014 Conference in Boston, MA – Day 3

On Tuesday, April 1st, 2014, I first attended the reprise presentation of the 2014 Franz Edelman Award Winner, i.e., by the U.S. Centers for Disease Control and Prevention team (see picture below). 436 more words

puppet cloudera module 2.0.2

This is a minor bugfix release of my Puppet module to deploy Cloudera Manager. When I released the module, I had assumed that the testing I did for the C5 beta2 would be 100% valid for C5 GA.  75 more words


Hadoop looks more promising than ever after Intel's big Cloudera deal

Hadoop’s software for storing, cleaning up, and analyzing lots of different kinds of data may not cost you much, but that doesn’t mean it’s second-rate. 449 more words


How to find out a table type in Hive Metastore.

Hi All

As Hive metastore is getting into the center of nervous system for the different type of  SQL engines like Shark and Impala. It getting equally difficult to distinguish type of table created in Hive metastore. 467 more words

Sunil S Ranka

Tech Funding In 2014 Is The Highest It’s Been In A Decade

2014 has been explosive year for tech funding. According to a report published this morning by venture capital database CB Insights, VC-backed deals in the first quarter of 2014 (Q1) are at the highest they’ve been since 2001. 194 more words

Venture Capital

Impala vs pig comparison on AWS EMR

Impala 1.0 was launched back in July last year, and it’s been supported by AWS EMR since last December so I’ve been meaning to have a quick play and also to compare it with a classic map-reduce approach to see the performance difference. 687 more words