Blog Archives
1 2

00: 13 Data Warehouse interview Q&As – Fact Vs Dimension, CDC, SCD, etc – part 1

Q1. What is dimensional modelling in a Data Warehouse (i.e. DWH)?
A1. A dimensional model is a data structure technique optimised for Data Warehousing tools (i.e. OLAP products). The concept of Dimensional Modelling is comprised of Fact and Dimension tables.

A “Fact” is a numeric value (i.e. aka a measure) that a business wishes to count or sum.…



00: 13 Data Warehouse interview Q&As – Fact Vs Dimension, CDC, SCD, etc – part 2

This extends Q1 to Q5 at 13 Data Warehouse interview Q&As – Fact Vs Dimension, CDC, SCD, etc – part 1. Q6. What is a Factless Fact table? A6. From…



01: Data Backfilling interview questions & answers

Q1. What is data backfilling? A1. Backfilling data is a process of reactively processing any missing data for a past time window. Q2. Why do you need to backfill data?…



03: Databricks – Spark SCD Type 1

Prerequisite: Extends Databricks getting started – Spark, Shell, SQL.

What is SCD Type 1

SCD stands for Slowly Changing Dimension, and it was explained in 10 Data warehouse interview Q&As.

Step 1: Remove all cells in the notebook with the “x” and then confirm or create a new Python notebook.…



04: Databricks – Spark SCD Type 2

Prerequisite: Extends 03: Databricks – Spark SCD Type 1. What is SCD Type 2 SCD stands for Slowly Changing Dimension, and it was explained in 10 Data warehouse interview Q&As….



04a: Databricks – Spark SCD Type 1 with Merge

Prerequisite: Extends 03: Databricks – Spark SCD Type 1. What is SCD Type 1 SCD stands for Slowly Changing Dimension, and it was explained in 10 Data warehouse interview Q&As….



04b: Databricks – Spark SCD Type 2 with Merge

Prerequisite:…



05: Q37-Q41 – Data lake & metadata interview Q&As

Q37. What is a Data Lake? A37. A data lake is a storage repository that holds a vast amount of structured, semi-structured, and unstructured raw data in its native format…



10 ERD (Entity-Relationship Diagrams) Interview Q&As

Q01. Can you describe a business domain of a Telecom company offering multiple services to its customers? A01. A Telecom company will have entities such as Customer, Account, Subscriptions &…



Apache Hive for Slowly Changing Dimension (i.e. SCD) interview Q&As

Q1. What is a Slowly Changing Dimension (i.e. SCD)? A1. SCD means the dimensions that change slowly over time, rather than changing on regular basis. For example, change in customer…



1 2

800+ Java & Big Data Interview Q&As

200+ Java & Big Data Tutorials

Top