Mar 1, 2024
My journey into the quantum world began with a simple question: how can something be in two places simultaneously? This seemingly nonsensical notion, central…
Dec 21, 2023
Renewable energy only accounts for around 11% of global energy production. This is a staggeringly low figure, considering that if we are to reach…
Nov 14, 2023
You might be familiar with Secretary of Transportation Pete Buttigieg. This 41-year-old former mayor of South Bend was born into a relatively comfortable life…
Nov 4, 2023
In the previous story, you learned to set up Ubuntu 20.04 on Windows 10 as Linux Subsystem Distribution. In this article, I will walk you through the…
Oct 28, 2023
On October 12, 2023, a significant event took place at the LinkedIn office in Bangalore, Karnataka. The Hadoop MeetUp featured a variety of engaging talks and…
Oct 27, 2023
Setting up Apache in a Docker container on Ubuntu 22.04 can be a straightforward process if you follow the step-by-step tutorial below. Docker allows…
Oct 26, 2023
Spill problem happens when the moving of an RDD (resilient distributed dataset, aka fundamental data structure in Spark) moves from RAM to disk and then…
Oct 26, 2023
Spill problem happens when the moving of an RDD (resilient distributed dataset, aka fundamental data structure in Spark) moves from RAM to disk and then…
Oct 26, 2023
The closer your data product is getting to the production, the bigger is the importance of properly collecting and analysing logs. Logs help both…
Oct 26, 2023
The answer is No. It last only for the duration of spark’s application run (till the time cluster is up in Databricks). As soon…
Oct 26, 2023
I have used Apache Spark (often as Azure Databricks) for some years and see it as a sledgehammer in data processing. It is a…
Oct 26, 2023
Databricks is a ~$40B company built around the open-source distributed computation engine Apache Spark. Their core offering is a high-level interface that allows organizations…