Blog Archives
1 2 3

01: Learn Hadoop API by examples in Java

These Hadoop tutorials assume that you have installed Cloudera QuickStart, which has the Hadoop eco system like HDFS, Spark, Hive, HBase, YARN, etc.

What is Hadoop & HDFS? Hadoop based data hub architecture & basics | Hadoop eco system basics Q&As style.

Read more ›



02: Learn Spark & AVRO Write & Read in Java by example

These Hadoop tutorials assume that you have installed Cloudera QuickStart, which has the Hadoop eco system like HDFS, Spark, Hive, HBase, YARN, etc. AVRO (i.e row oriented) and Parquet (i.e. column oriented) file formats are HDFS (i.e. Hadoop Distributed File System) friendly binary data formats as they store data compressed...

Members Only Content
Log In Register Home


02a: Learn Spark writing to Avro using Avro IDL

What is Avro IDL? Avro IDL (i.e. Interface Description Language) is a high-level language to write Avro schemata. You can generate Java, C++, and Python objects from the Avro IDL files. These files generally have the “.avdl” extension. Step 1: Write the “order.avdl” … Read more ›...

Members Only Content
Log In Register Home


03: Learn Spark & Parquet Write & Read in Java by example

These Hadoop tutorials assume that you have installed Cloudera QuickStart, which has the Hadoop eco system like HDFS, Spark, Hive, HBase, YARN, etc. AVRO (i.e row oriented) and Parquet (i.e. column oriented) file formats are HDFS (i.e. Hadoop Distributed File System) friendly binary data formats as they store data compressed...

Members Only Content
Log In Register Home


04: Learn how to connect to HBase from Spark using Java API

These Hadoop tutorials assume that you have installed Cloudera QuickStart, which has the Hadoop eco system like HDFS, Spark, Hive, HBase, YARN, etc. What is HBase? Apache HBase is a NoSQL database used for random and real-time read/write access to your Big Data. It is built on top of the...

Members Only Content
Log In Register Home


05: Learn Hive to write to and read from AVRO & Parquet files by examples

These Hadoop tutorials assume that you have installed Cloudera QuickStart, which has the Hadoop eco system like HDFS, Spark, Hive, HBase, YARN, etc. What is Apache Hive? Hive allows SQL developers to write Hive Query Language (HQL) statements that are similar to standard SQL statements. … Read more ›...

Members Only Content
Log In Register Home


06: Learn how to access Hive from Spark via SparkSQL & Dataframes by example

These Hadoop tutorials assume that you have installed Cloudera QuickStart, which has the Hadoop eco system like HDFS, Spark, Hive, HBase, YARN, etc. This example extends Learn Hive to write to and read from AVRO & Parquet files by examples to access Hive metastore via Spark SQL. … Read more...

Members Only Content
Log In Register Home


1 2 3

800+ Java Interview Q&As Menu

Learn by categories on the go...
Learn by categories such as FAQs – Core Java, Key Area – Low Latency, Core Java – Java 8, JEE – Microservices, Big Data – NoSQL, Architecture – Distributed, Big Data – Spark, etc. Some posts belong to multiple categories.
Top