1. What is Apache Spark?
- A open source and powerful data processing engine.
- Complement (or even replace) its pioneer counterpart – Hadoop in the future due to much better performance. 1,063 more words
Interesting huh, it seems that Spark has many promises in foreseeable future.
I am sure some of us wanted to experience Hadoop in a true cluster, not just limiting to a pseudo cluster (aka, single node). I played a bit with other distributions in the past, including PHD and CDH. 322 more words