Tags » Hive

Find the file name corresponding to a record in hive

Every table in hive has two virtual columns. They are

  • INPUT__FILE__NAME
  • BLOCK__OFFSET__INSIDE__FILE

INPUT__FILE__NAME give the name of the file.

BLOCK__OFFSET__INSIDE__FILE is the current global file position. 120 more words

BigData World

Jack Monroe and supporting local independent bookstores

Hello blogworld! The past two weeks have been a bit of a knackering, stressful blur of overtime and launching a complete replatform, but HURRAH, that’s over, and now I can reclaim the extra hours in my day to sit at my desk and actually start writing again. 377 more words

Books

Busy as a Bee

I love bees, don’t you? I think everyone would agree they are one of the cutest insects, and the whole honey-making thing is truly amazing (I love honey, too). 742 more words

Book Review

Configure Apache Hive to Recursively Search Directories for Files

It is common, such as when using Flume to collect log data for example, that files end up inside subdirectories in HDFS.

By default, Hive will only look for files in the root of directory specified, but with a couple of tweaks, it can be configured to look recursively through subdirectories. 241 more words

Big Data

Box keeping

We now are the proud owners of a bee box and have been graced with a swarm to live in it. I believe we will learn a lot, a l o t, from this latest ‘farming’ adventure. 55 more words

Critters

Flowers that were by the Bee hive.

When I went to the hive today there were several of these passions flowers blooming everywhere. They were so pretty I took a picture….



The Bees