Blog Archives

01: Q01 – Q07 General Big Data, Data Science & Data Analytics Interview Q&As

Q01. How is Big Data used in industries?
A01. The main goal for most organisations is to enhance customer experience, and consequently increase sales. The other goals include cost reduction, better targeted marketing, fraud detection, identifying data breaches to enhance security, making existing processes more efficient,

Read more ›



02: Cleansing & pre-processing data in BigData & machine learning with Spark interview Q&As

Q1. Why are data cleansing & pre-processing important in analytics & machine learning? A1. Garbage in gets you garbage out. No matter how good your machine learning algorithm is. Q2. What are the general steps of cleansing data A2. … Read more ›...

Members Only Content
Log In Register Home


03: Simple Linear Regression interview Q&As

Q01. What is a gradient? A01. In algebra we can represent a straight line with: y = mx + c A parabola is represented as: y = m1x2 + m2x + c, and so on. … Read more ›...

Members Only Content
Log In Register Home


04: Residuals, Cost/Loss functions, R-squared & Gradient Descent interview Q&As

Q01. What do you understand by the terms mean, variance, and standard deviation of the sample Vs. the population? A01. Given that the following are the number of job applications sent by 6 individuals: Where X is the Sample. … Read more ›...

Members Only Content
Log In Register Home


05: Linear regression outputs, null hypothesis, t-test & p-value interview Q&As

Q1. How do you produce & interpret Linear Regression output? A1. Scatter plots can only detect obvious relationships between variables by looking at the graph, but we can use statistics to comment about the variable relationships as outlined below. The link 11A: Databricks – Spark ML – Pandas Dataframe &...

Members Only Content
Log In Register Home


800+ Java Interview Q&As Menu

Learn by categories on the go...
Learn by categories such as FAQs – Core Java, Key Area – Low Latency, Core Java – Java 8, JEE – Microservices, Big Data – NoSQL, Architecture – Distributed, Big Data – Spark, etc. Some posts belong to multiple categories.
Top