800+ Java Interview Questions & Answers for 2 to 5 and 5 to 10+ years of experience & architects

Read more ...


500+ Big Data Interview Questions & Answers for 2 to 5 and 5 to 10+ years of experience & architects

Read more ...


11 Tips to have a rewarding career in Software & Data Engineering

A rewarding career should not only have a monetary reward, but also much needed non-monetary rewards like work life balance, sense of accomplishments, pride of building significant systems, progression, learning opportunities, building networks, making a difference, mentoring, etc.

The key take aways of this post are ….

1) Taking the road less travelled.
2) Prompting for a reality check.

#01 Balanced routine & consistency

Many so called influencers advice to work very hard and learn new technologies to earn 2x or 3x your salary.… Read more ...



29: NumPy Vs Pandas interview questions & answers – part 3

Q01. What is an index in Pandas? A01. An index is a series of labels that can uniquely identify each row of a DataFrame. The index can be of any datatype like integer, string, hash, etc. pandas_test.py

Read more ...


28: NumPy Vs Pandas interview questions & answers – part 2

Q01. What is the difference between a Python List & NumPy? A01. Even though both superficially look the same with contents and indexes starting from 0 as shown below:

Read more ...


22: PySpark Row object Interview Q&As with tutorials

This extends PySpark map vs flatMap Interview Q&As with tutorials. Q01. What is a Row object in PySpark? A01. Row can be used to create a row object by using named arguments. The previous example can be represented with a Row object. Given the below input, how will you concat…

Read more ...


21: PySpark map vs flatMap Interview Q&As with tutorials

Q01. What is the difference between map and flatMap operations in Spark? A01. The map and flatMap are transformation operations in Spark. map transformation is applied to each element of RDD or a DataFrame and it returns the result as a new RDD or a DataFrame. Map takes N elements…

Read more ...


20: PySpark Dataframe Vs Pandas Dataframe Interview Q&As with tutorials

This extends NumPy Vs Pandas interview questions & answers – part 1 Q: What is the difference between PySpark Dataframe & a Pandas Dataframe? A: PySpark is a library where the operations are quicker than Pandas Dataframe library because of its parallel execution over multiple CPU cores & distributed in…

Read more ...


27: NumPy Vs Pandas interview questions & answers – part 1

Q01. What is the difference between NumPy & Pandas? A01. Firstly, pandas, which is a Python library for data wrangling is built on top of the NumPy python library. NumPy is short for Numerical Python, and used for scientific computing. This library is made up of multidimensional array objects and…

Read more ...


19: PySpark on handling duplicates Interview Q&As with tutorials

Given the below data, where the record with the emp_name “Elliot” is repeated.

distinct(..)… Read more ...



Java Developer & Architect Q&As

Big Data Engineer & Architect Q&As

16+ Key Areas & 13+ Techs to fast-track