Before running a Spark job on a YARN cluster in Cloudera and about the Spark history server

Problem: When you run a Spark job via “spark-submit” command on a “YARN” cluster as shown below in a terminal,

It creates a folder and files in HDFS under

The files will be named with “application ids” like…

Members Only Content

Log In Register Home

Categories Menu - Q&As, FAQs & Tutorials