Wednesday 4 December 2013

Installing cloudera quick start VM on VM VirtualBox Manager - tips and troubleshooting for Windows 64bit


Installing cloudera quick start VM on VM VirtualBox Manager


While i was doing installation of cloudera quick start VM on VM virtualBox manager, i did some stupid mistakes which made me spend time in fixing. so sharing some of these so that others can avoid the same.

  1. While downloading please make sure you download the software meant for your Virtual Machine like VirtualBox, KVM etc.. I was having oracle VM Virtual box but first time downloaded KVM and got errors. Correct one for oracle VM VirtualBox was VirtualBox version.

  1. once you connect to terminal in centOS in cloudera VM (after installing it) it logs as cloudera user and this user does not have all rights/privs on all directories of the VM. You might need to do sudo -su hive or hdfs or root - based on your needs to execute some commands.
  2. By default cloudera comes with cloudera manager and hue - which are good interface to administer the system and to see details of services, jobs, run sql in hive etc..you simply need to login as cloudera/cloudera (in version i downloaded cloudera-quickstart-vm-4.4.0-1-virtualbox).
  3.  Hive metastore - for a while i struggled to find whether installation i using derby hive metastore or mysql or others...it turned out that it was using postgres db for metastore. if you serach for hive*.xml you will see more than one version for same, so it was little confusing which is the version being used. 
  4. So to find details about metastore DB - i did following
    1. go to cloudera admin console 
    2. navigate to services/hive/(service-wide option in left panel) 
    3. --see hive metastore database - here you will see which DB it uses...(postgres in my case) and also you can see db name  and dbport number (7432 in my case)
  5. postgressql db password - to get password for hive metastore DB - you can do following
  6. username to be used is  cloudera-scm
    password is available in file  /var/lib/cloudera-scm-server-db/data/generated_password.txt
     
  7. now to connect to hive metastore
    launch pgadmin (you have to download GUI version if not available)
    server - localhost
    port - 7432 (noted in previous steps)
    databasename - as noted in previous steps

No comments:

Post a Comment