Compressing the output of sqoop

The output of a sqoop job can be compressed directly. Sqoop job is a mapreduce job, so by setting the mapreduce output compression codec, we can get the output of sqoop compressed. 56 more words

Compressing and decompressing strings with BZip2 in .NET C#

There are times when you need to return a large text from a web service. The large text will then need to be handled by the recipient. 215 more words


Compressing and decompressing files with BZip2 in .NET C#

BZip2 is yet another data compression algorithm, similar to GZip and Deflate. There’s no native support for BZip2 (de)compression in .NET but there’s a NuGet package provided by… 118 more words


Learning R: R can read compressed files!

It might be a tad hard to believe, but I am new … well … somewhat new, to R. I have been using R for a while, but it has been mostly by copying the proper commands from some place, understanding what they do, then putting them into a program of mine so that the commands would be generated by my other program, and then ran by my other program too. 366 more words



If you have worked with Linux to any degree, then you have noticed these extension’s

.tar .bz2 .tgz .tar.gz .tar.bz2 .gzip etc

There are many others, but today I am talking about .bz2 (bzip2) 96 more words


(BISPL) 2. Unpacking the Files

Welcome to the “There is no standard way of doing anything” world of UNIX® and GNU/Linux. Sometimes you use tar; sometimes untar. Then there’s the bzip family and zip, arc, lha, arj, zoo, rar, and shk. 89 more words


Wikipedia Infrastructure as of 2013

Some facts on Wikipedia:

  1. Wikipedia uses round about 1,200 servers, see Tactical Monitoring Overview. These servers are located in Ashburn (Virginia), Tampa (Florida), San Francisco, and Amsterdam, see…
  2. 68 more words