Tags » Hadoop

Creating multiple spark sessions in kerberos enabled cluster throws error

ISSUE:Creating multiple spark sessions in kerberos enabled cluster throws below error

Py4JJavaError: An error occurred while calling None.org.apache.spark.api.java.JavaSparkContext.

: org.apache.hadoop.ipc.RemoteException(java.io.IOException): Delegation Token can be issued only with kerberos or web authentication… 53 more words


Big Data for Beginners in chennai

Hadoop is a free, Java-based programming system that backings the handling of vast data sets in a parallel dispersed figuring condition. It is a piece of the Apache venture supported by the Apache Software… 264 more words

Big Data

Performance comparison of different file formats and storage engines in the Apache Hadoop ecosystem


This post presents a performance comparison of few popular data formats and storage engines available in the Apache Hadoop ecosystem: Apache Avro, Apache Parquet, Apache HBase… 2,274 more words


Big Data: What is Hadoop?

In my last article I introduced you “Big Data“. With that in mind, you cannot keep yourself away from Hadoop. So what is Hadoop then? 698 more words

Big Data

Oracle Free Cloud Trial - Part 3: Hadoop

This is Part 3 of my series of four blog articles recording my experiences trying out the Oracle Cloud using the 30 day / $300 Free Trial offer.  1,387 more words


Hadoop MapReduce: Hottest / Coolest Year

# Download dataset.zip file (attached with this post)

# It contains NCDC weather data from year 1901 to year 1920.
# Copy and extract dataset.zip in your home folder… 877 more words

Hadoop Delegation Tokens Explained

Apache Hadoop’s security was designed and implemented around 2009, and has been stabilizing since then. However, due to a lack of documentation around this area, it’s hard to understand or debug when problems arise. 3,629 more words