Blog Archives
1 2

01: Learn Hadoop API by examples in Java

These Hadoop tutorials assume that you have installed Cloudera QuickStart, which has the Hadoop eco system like HDFS, Spark, Hive, …

Read more ›



02: Learn Spark & AVRO Write & Read in Java by example

These Hadoop tutorials assume that you have installed Cloudera QuickStart, which has the Hadoop eco system like HDFS, Spark, Hive, … … Read more ›...

Members Only Content
Log In Register Home


02a: Learn Spark writing to Avro using Avro IDL

What is Avro IDL? Avro IDL (i.e. Interface Description Language) is a high-level language to write Avro schemata. You can generate … … Read more ›...

Members Only Content
Log In Register Home


03: Learn Spark & Parquet Write & Read in Java by example

These Hadoop tutorials assume that you have installed Cloudera QuickStart, which has the Hadoop eco system like HDFS, Spark, Hive, … … Read more ›...

Members Only Content
Log In Register Home


04: Learn how to connect to HBase from Spark using Java API

These Hadoop tutorials assume that you have installed Cloudera QuickStart, which has the Hadoop eco system like HDFS, Spark, Hive, … … Read more ›...

Members Only Content
Log In Register Home


05: Learn Hive to write to and read from AVRO & Parquet files by examples

These Hadoop tutorials assume that you have installed Cloudera QuickStart, which has the Hadoop eco system like HDFS, Spark, Hive, … … Read more ›...

Members Only Content
Log In Register Home


06: Learn how to access Hive from Spark via SparkSQL & Dataframes by example

These Hadoop tutorials assume that you have installed Cloudera QuickStart, which has the Hadoop eco system like HDFS, Spark, Hive, … … Read more ›...

Members Only Content
Log In Register Home


07: Learn Spark Dataframes to do ETL in Java with examples

These Hadoop tutorials assume that you have installed Cloudera QuickStart, which has the Hadoop eco system like HDFS, Spark, Hive, … … Read more ›...

Members Only Content
Log In Register Home


08: Learn Spark how to convert RDD in Java to Dataframe with examples

These Hadoop tutorials assume that you have installed Cloudera QuickStart, which has the Hadoop eco system like HDFS, Spark, Hive, … … Read more ›...

Members Only Content
Log In Register Home


09: Running a Spark job on YARN cluster in Cloudera

This assumes that you have installed Cloudera QuickStart, which has the Hadoop eco system like HDFS, Spark, Hive, HBase, YARN, … … Read more ›...

Members Only Content
Log In Register Home


10: Solving AlreadyBeingCreatedException & LeaseExpiredException thrown from your Spark jobs

What is wrong with the following Spark code snippet? You are likely to get AlreadyBeingCreatedException & LeaseExpiredException thrown as multiple executors … … Read more ›...

Members Only Content
Log In Register Home


11. What are part- files in Hadoop & 6 ways to merge them

What are the part-xxxx files generated by Hadoop? When you invoke rdd.saveAsTextFile(…) or rdd.saveAsNewAPIHadoopFile(…) from Spark you will get part- files. … … Read more ›...

Members Only Content
Log In Register Home


12: XML Processing in Spark with XmlInputFormat

Step 1: Read the XML snippet in between the tags “<Record>”. Upload this file to HDFS “/user/cloudera/xml/orders.xml”. Step 2: You need … … Read more ›...

Members Only Content
Log In Register Home


1 2

Java FAQs to Fast-track & Go places

Java Interview Q&As

Top