Blog Archives

00: 13 Data Warehouse interview Q&As – Fact Vs Dimension, CDC, SCD, etc – part 1

Q1. What is dimensional modelling in a Data Warehouse (i.e. DWH)?
A1. A dimensional model is a data structure technique optimised for Data Warehousing tools (i.e.

Read more ›



00: 13 Data Warehouse interview Q&As – Fact Vs Dimension, CDC, SCD, etc – part 2

This extends Q1 to Q5 at 13 Data Warehouse interview Q&As – Fact Vs Dimension, CDC, SCD, etc – part 1. Q6. … Read more ›...



01: Data Backfilling interview questions & answers

Q1. What is data backfilling? A1. Backfilling data is a process of reactively processing any missing data for a past time window. … Read more ›...



03: Databricks – Spark SCD Type 1

Prerequisite: Extends Databricks getting started – Spark, Shell, SQL.

What is SCD Type 1

SCD stands for Slowly Changing Dimension,

Read more ›



04: Databricks – Spark SCD Type 2

Prerequisite: Extends 03: Databricks – Spark SCD Type 1. What is SCD Type 2 SCD stands for Slowly Changing Dimension, … Read more ›...



04a: Databricks – Spark SCD Type 1 with Merge

Prerequisite: Extends 03: Databricks – Spark SCD Type 1. What is SCD Type 1 SCD stands for Slowly Changing Dimension, … Read more ›...



04b: Databricks – Spark SCD Type 2 with Merge

Prerequisite: … Read more ›...



05: Q37-Q41 – Data lake & metadata interview Q&As

Q37. What is a Data Lake? A37. A data lake is a storage repository that holds a vast amount of structured, semi-structured, and unstructured raw data in its native format...



10 ERD (Entity-Relationship Diagrams) Interview Q&As

Q01. Can you describe a business domain of a Telecom company offering multiple services to its customers? A01. A Telecom company will have entities such as Customer, … Read more...



Apache Hive for Slowly Changing Dimension (i.e. SCD) interview Q&As

Q1. What is a Slowly Changing Dimension (i.e. SCD)? A1. SCD means the dimensions that change slowly over time, rather than changing on regular basis. … Read more ›...



Canonical Data Model (i.e. CDM) interview Q&As

Q01. What do you understand by the term canonicalizing? A01. Canonicalizing is an activity of replacing multiple copies of an object/entity/URL with just a few objects/entities/URLs. … Read more ›...



Data categories in Data warehouse & Data lake

Q01. What are the different types of data that get stored in a data lake or data warehouse? A01. An enterprise stores different types of data. … Read more ›...



SQL querying Temporal tables & SCD Type 2

Q1. What is a temporal table? A1. Temporal tables store and maintain information with respect to time. … Read more ›...



300+ Java & Big Data Interview FAQs

800+ Java Interview Q&As

Java & Big Data Tutorials

Top