Missing Hive Databases in Cloudera Hue

I created some new databases using hive cli, but those are not listed in Hue. The browser I’m using is Google Chrome.

I logged out and re-login, failed.

I deleted all cached browser files, failed.

I checked cookies, nothing found there.

No API queries are found in the Network tab.

I noticed the ‘Local Storage’ in Chrome’s Developer Tools, many configurations are cached there.
Delete those and re-login.

Missing Hive Databases in Cloudera Hue by @sskaje: https://sskaje.me/2014/11/missing-hive-databases-cloudera-hue/

Fix Alternatives for Cloudera Manager + CDH

Earlier post: Fix Hadoop Conf Alternatives for CDH5

I tried to upgrade Cloudera Manager + CDH 5.0.0 beta 1 and beta 2 from CM+CDH 4 then downgrade and delete, found many alternatives were installed on my small cluster, that made my lately installed CM+CDH 4 and CM+CDH 5 not working well, all because of the dirty uninstallation of CM + CDH 5 beta-s.

To fix these alternatives, I wrote a python script, read default alternative configurations, check all currently installed alternatives and delete broken links, install defaults and bring down priority, so we can use ‘Deploy Client Configuration’ in CM to set up the correct ones.

Repository: https://github.com/sskaje/cm_fix_alternatives

Tested only under centos 6.

Fix Alternatives for Cloudera Manager + CDH by @sskaje: https://sskaje.me/2014/04/fix-alternatives-cloudera-manager-cdh/

Cloudera Manager Drop Database/User on Embedded Postgresql

First, you have to find out where the password locates.
I ran these on which Cloudera Manager locates.

The cloudera-scm is the superuser.
And the first line in that file is the password.

To drop a database, dropdb is the command:

Commands may like:

To drop a user, using dropuser:

Commands may like:

Cloudera Manager Drop Database/User on Embedded Postgresql by @sskaje: https://sskaje.me/2014/04/cloudera-manager-drop-databaseuser-embedded-postgresql/

Incoming search terms:

Install Spark/Shark on CDH 4

CDH 4 is the currently stable version of Cloudera Distribution of Hadoop. http://cloudera.com/
Apache Spark is a fast and general engine for large-scale data processing. http://spark.incubator.apache.org/
Shark is a Hive compatible query engine Based on Spark. http://shark.cs.berkeley.edu/

Cloudera provides a parcel for Apache Spark, official parcel at http://archive.cloudera.com/spark/ and you can get it from my mirror (only if you’re on CentOS/RHEL 6 x86_64) Cloudera Mirror.


CentOS 6.4 x86_64, host names hadoop1-hadoop5.
Cloudera Manager 4.8.1
CDH 4.5.0
Continue reading “Install Spark/Shark on CDH 4” »

Install Spark/Shark on CDH 4 by @sskaje: https://sskaje.me/2014/02/install-spark-shark-cdh-4/

Incoming search terms:

Fix Hadoop Conf Alternatives for CDH5

I’m using CDH5, upgraded failed from CDH4 and then reinstalled directly.
/etc/hadoop/conf is linked to /etc/hadoop/conf/conf.cloudera.mapreduce1/.
Deploy Client Configuration does not make it right.

The way fix it is manually set a new path and remove the old one, like:

But the next time you try Deploy Client Configuration would corrupt it again.

Continue reading “Fix Hadoop Conf Alternatives for CDH5” »

Fix Hadoop Conf Alternatives for CDH5 by @sskaje: https://sskaje.me/2014/02/fix-hadoop-conf-alternatives-cdh5/