Blog Archives

09: Q76– Q88 Apache Hive partitioning, bucketing, map join, skewed data, vectorization & denormalization interview Q&As

This extends Apache Hive basics interview questions & answers. Q76. What are the 3 key considerations in processing data in HDFS compared to traditional RDBMS or EDW (i.e. Enterprise Data Warehouse)? AQ76. 1) partitioning, 2) bucketing, and 3) denormalization. Let’s look at these concepts in Hive with examples. Q77. Can…

Read more ...


10 Apache Hive scenarios interview Q&As on backup, repair, partition, drop, etc

Q01: How do you back up an external with Data? A01 You can use CTAS (i.e. CREATE TABLE AS SELECT ) statement as shown below.

Q02: What if the table you are backing up is partitioned? A02 You can check if the table being backed up is partitioned with…

Read more ...


13: Q98 – Q104 Apache Hive Basics Interview questions & answers

Q98. What is Hive? A98. Hive is used for accessing and analyzing data in Hadoop using SQL syntax. It is known as the HiveQL. Q99. What is the difference between Hive internal tables & external tables? A99. When you drop an internal table, it drops the data, and it also…

Read more ...


Apache Hive for Slowly Changing Dimension (i.e. SCD) interview Q&As

Q1. What is a Slowly Changing Dimension (i.e. SCD)? A1. SCD means the dimensions that change slowly over time, rather than changing on regular basis. For example, change in customer name or address. There are different types of changing dimensions, and type 1 & type 2 are the most common….

Read more ...


Q01 – Q07 Scenarios Based Hive Interview Q&As

Q1. Given you have a Hive table partitioned by ship_type where the ship_type can be DIRECT or PICK_UP. The last job run to populate the DIRECT shipment data for the month of “2021 Feb” has caused a data corruption. How will you go about rollback data load & rerun the…

Read more ...


Q08- Q12 Scenarios Based Hive Query Language (i.e. HQL) Interview Q&As

This extends Q01 – Q07 Scenarios Based Hive Query Language (i.e. HQL) Interview Q&As. Q08. When you have a requirement to replicate all the data from a particular source table table to a target table, how will you go about it? A08. There a number of approaches as outlined below….

Read more ...


500+ Enterprise & Core Java programmer & architect Q&As

Java & Big Data Tutorials

Top