01: Spark RDD joins in Scala tutorial

This tutorial extends Setting up Spark and Scala with Maven.

Step 1: Let’s take a simple example of joining a student to department. This will be written in an SQL world as:

joins in Spark RDDs

joins in Spark RDDs

Step 2: Let’s create classes to represent Student and Department data.

Step 3: The Spark job with a RDD join.

Q. Why join with a department?
A. To get the department name.

Output:


Categories Menu - Q&As, FAQs & Tutorials

Top