Prerequisite: At least 12GB+ RAM (i.e. 4GB+ for operating system & 8GB+ for Cloudera), although 16 GB+ is preferred. 80GB Hard Disk. Cloudera runs on CentOS, which is the community edition of the Linux. Windows system must support 64-bit.
Install VMWare for Windows
Step 1: Download the VMWare player for Windows from https://my.vmware.com/web/vmware/free and then select VMWare Workstation Plyer. The URL used for this download is “https://my.vmware.com/web/vmware/free#desktop_end_user_computing/vmware_workstation_player/14_0”.
Step 2: Install the downloaded VMWare by double clicking on the downloaded “.exe” file. Restart windows after installing.
Install Cloudera for VMWare
Step 1: Download the Cloudera quickstart-vm for VMWare (e.g. cloudera-quickstart-vm-5.12.0-0-vmware) for Windows from “https://www.cloudera.com/downloads/quickstart_vms/5-12.html“.
Fill in the form and mark the purpose as self-learning, and download the “zip” file, which will take some time to download.
Step 2: Extract the downloaded “cloudera-quickstart-vm-5.12.0-0-vmware.zip”.
Step 3: Open up the installed VMWare Workstation Plyer for non-commercial use to learn Cloudera.
Step 4: Open a virtual machine by selecting previously downloaded and extracted file “cloudera-quickstart-vm-5.12.0-0-vmware.vmx” as shown below.
Step 5: Click on edit settings and allocate at least “2 CPU Cores” and “8 GB” RAM. You need “4GB” RAM for the operating system and the remaining for the VM. You can find out your windows system info via start -> run and then type “msinfo32.exe”.
Step 6: Click on play.
Step 7: Once it has started, you will see the following screen.
Step 8: Click on the Launch Cloudera Express.
Step 9: Login to Cloudera Manager with “cloudera” and “cloudera” as username/password.
Step 10: Start all the services.
Step 11: Start Hue, which is web interface for HDFS, HBase, Spark UI, Hive, etc.
Step 12: Login to Hue with “cloudera/cloudera”. The HaDoop File System (i.e HDFS) is shown below.
Step 13: Logout of Hue and Cloudera Manager, and then shut down VM.