Prerequisite: At least 12GB+ RAM (i.e. 4GB+ for operating system & 8GB+ for Cloudera), although 16 GB+ is preferred. 80GB Hard Disk. Cloudera runs on CentOS, which is the community edition of the Linux. Windows system must support 64-bit.
This post is obsolete with the newer version of Cloudera.…
This extends Installing & getting started with Cloudera QuickStart on VMWare for windows to upgrade the Java version from 1.7.x to 1.8.x. Firstly, check if your Cloudera version supports Java 1.8. This example is running on Cloudera 5.12.
This post is obsolete with the new Cloudera version.
Step 1: Power on the VMWare and login to “Cloudera Manager”.…
Problem: When you run a Spark job via “spark-submit” command on a “YARN” cluster as shown below in a terminal,
-bash-4.1$ spark-submit --class com.mytutorial.SparkSimpleRdd --master yarn --deploy-mode cluster /home/cloudera/projects/simple-spark/target/simple-spark-1.0-SNAPSHOT.
Step 1: A number of special users are created by default when installing and using CDH and Cloudera Manager. For example
Unix user id: hdfs
groups: hdfs hadoop
Unix user id: spark
Unix user id: hive
and so on.
The Cloudera manager processes will use the Unix Id of “cloudera-scm” and group of “cloudera-scm”.…