⏯ Getting started with BigData on Cloudera

Note: Some of these modules are obsolete, especially the installation steps etc, hence provided for free to compliment the Big Data Interview Q&As for those who want to jog the memory on HDFS, Spark, Hive, HBase, etc.

Why low latency & Big Data are high income skills? | 200+ Big Data Interview Q&As with diagrams

Module 1Installing & getting started with Cloudera Quick Start
Installing & getting started with Cloudera Quick Start.
Unit 1Installing & getting started with Cloudera QuickStart on VMWare for windows in 17 steps  - Preview
Unit 2⏯ Cloudera Hue, Terminal Window (on edge node) & Cloudera Manager overview  - Preview
Unit 3Understanding Cloudera Hadoop users  - Preview
Unit 4Upgrading Java version to JDK 8 in Cloudera Quickstart  - Preview
Module 2Getting started with HDFS on Cloudera
Getting started with HDFS (i.e. Hadoop Distributed File System)
Unit 1⏯ Hue and terminal window to work with HDFS  - Preview
Unit 2Java program to list files in HDFS & write to HDFS using Hadoop API  - Preview
Unit 3⏯ Java program to list files on HDFS & write to a file in HDFS  - Preview
Unit 4Write to & Read from a csv file in HDFS using Java & Hadoop API  - Preview
Unit 5⏯ Write to & read from HDFS using Hadoop API in Java  - Preview
Module 3Running an Apache Spark job on Cloudera
Running an Apache Spark job on Cloudera Distribution including Hadoop (aka CDH)
Unit 1Before running a Spark job on a YARN cluster in Cloudera  - Preview
Unit 2Running a Spark job on YARN cluster in Cloudera  - Preview
Unit 3⏯ Running a Spark job on YARN cluster  - Preview
Unit 4Write to HDFS from Spark in YARN mode & local mode  - Preview
Unit 5⏯ Write to HDFS from Spark in YARN & local modes  - Preview
Unit 6Spark running on YARN and Local modes reading from HDFS  - Preview
Unit 7⏯ Spark running on YARN and Local modes reading from HDFS  - Preview
Module 4Hive on Cloudera
Hive on Cloudera
Unit 1Getting started with Hive  - Preview
Unit 2⏯ Getting started with Hive  - Preview
Module 5HBase on Cloudera
HBase on Cloudera
Unit 1Write to HBase from Java  - Preview
Unit 2Read from HBase in Java  - Preview
Unit 3HBase shell commands to get, scan, and delete  - Preview
Unit 4⏯ Write to & read from HBase  - Preview
Module 6Writing to & reading from Avro in Spark
Writing to & reading from Avro in Spark
Unit 1Write to an Avro file from a Spark job in local mode  - Preview
Unit 2Read an Avro file from HDFS via a Spark job running in local mode  - Preview
Unit 3⏯ Write to & read from an Avro file on HDFS using Spark  - Preview
Unit 4Write to HDFS as Avro from a Spark job using Avro IDL  - Preview
Unit 5⏯ Write to Avro using Avro IDL from a Spark job  - Preview
Unit 6Create a Hive table over Avro data  - Preview
Unit 7⏯ Hive table over an Avro folder & avro-tools to generate the schema  - Preview
Module 7Writing to & reading from Parquet in Spark
Writing to & reading from Parquet in Spark
Unit 1Write to a Parquet file from a Spark job in local mode  - Preview
Unit 2Read from a Parquet file in a Spark job running in local mode  - Preview
Unit 3⏯ Write to and read from Parquet data on HDFS via Spark  - Preview
Unit 4Create a Hive table over Parquet data  - Preview
Unit 5⏯ Hive over Parquet data  - Preview
Module 8Spark SQL
Spark SQL
Unit 1Spark SQL read a Hive table  - Preview
Unit 2Write to Parquet using Spark SQL & Dataframe  - Preview
Unit 3Read from Parquet with Spark SQL & Dataframe  - Preview
Unit 4⏯ Spark SQL basics video tutorial  - Preview
Module 9Spark streaming
Spark streaming
Unit 1Spark streaming text files  - Preview
Unit 2Spark file streaming in Java  - Preview
Unit 3⏯ Spark streaming video tutorial  - Preview

Top