Blog Archives

09: Q76– Q88 Apache Hive partitioning, bucketing, map join, skewed data, vectorization & denormalization interview Q&As

This extends Apache Hive basics interview questions & answers. Q76. What are the 3 key considerations in processing data in HDFS compared to traditional RDBMS or EDW (i.e. Enterprise Data Warehouse)? AQ76. 1) partitioning, 2) bucketing, … Read more ›...

Members Only Content
Log In Register Home


13: Q98 – Q104 Apache Hive Basics Interview questions & answers

Q98. What is Hive? A98. Hive is used for accessing and analyzing data in Hadoop using SQL syntax. It is known as the HiveQL. Q99. What is the difference between Hive internal tables & external tables? A99. … Read more ›...

Members Only Content
Log In Register Home


Apache Hive for Slowly Changing Dimension (i.e. SCD) interview Q&As

Q1. What is a Slowly Changing Dimension (i.e. SCD)? A1. SCD means the dimensions that change slowly over time, rather than changing on regular basis. For example, change in customer name or address. There are different types of changing dimensions, and type 1 & … Read more ›...

Members Only Content
Log In Register Home


Q01 – Q07 Scenarios Based Hive Query Language (i.e. HQL) Interview Q&As

Q1. Given you have a Hive table partitioned by ship_type where the ship_type can be DIRECT or PICK_UP. The last job run to populate the DIRECT shipment data for the month of “2021 Feb” has caused a data corruption. How ill you go about rollback data load & … Read...

Members Only Content
Log In Register Home


800+ Java Interview Q&As Menu

Learn by categories on the go...
Learn by categories such as FAQs – Core Java, Key Area – Low Latency, Core Java – Java 8, JEE – Microservices, Big Data – NoSQL, Architecture – Distributed, Big Data – Spark, etc. Some posts belong to multiple categories.
Top