Install Spark/Shark on CDH 4

CDH 4 is the currently stable version of Cloudera Distribution of Hadoop. http://cloudera.com/ Apache Spark is a fast and general engine for large-scale data processing. http://spark.incubator.apache.org/ Shark is a Hive compatible query engine Based on Spark. http://shark.cs.berkeley.edu/ Cloudera provides a parcel for Apache Spark, official parcel at http://archive.cloudera.com/spark/ and you can get it from my … Continue reading “Install Spark/Shark on CDH 4”

Cloudera Mirror

Currently available for CentOS/RHEL 6 x86_64. Address: http://cloudera.rst.im/ rsync://cloudera.rst.im/cloudera/ Cloudera Manager Cloudera Manager 5.0.2 Cloudera Manager 5: installer for CM5, yum repo file for CM5 Cloudera Manager 4.8.3 Cloudera Manager 4: installer for CM4, yum repo file for CM4 Cloudera Distribution of Hadoop CDH’s official product page: http://www.cloudera.com/content/cloudera/en/products-and-services/cdh.html Parcel is my favorite choice installing CDH … Continue reading “Cloudera Mirror”