Amar Prakash Pandey – Medium

Amar Prakash Pandey

Pinned

Published in

From Bottlenecks to Balance: Dynamic Skew Join Fixes in Spark

When working with large datasets in Spark, joins are a common operation. But what happens when data distribution isn’t uniform? Let’s dive…

Apr 13, 2025

From Bottlenecks to Balance: Dynamic Skew Join Fixes in Spark

Apr 13, 2025

Pinned

Published in

Fine-Tuning Shuffle Partitions in Apache Spark for Maximum Efficiency

Apache Spark’s shuffle partitions are critical in data processing, especially during operations like joins and aggregations. Properly…

Feb 12, 2025

Fine-Tuning Shuffle Partitions in Apache Spark for Maximum Efficiency

Feb 12, 2025

Published in

Apache Spark SQL Engine and Query Planning

Apache Spark is a powerful distributed computing framework that provides two interfaces for working with data:

Apr 6, 2025

Apache Spark SQL Engine and Query Planning

Apr 6, 2025

Published in

Deep Dive into Apache Spark Jobs and Stages

Understanding how jobs and stages work is crucial to optimizing performance with large-scale data processing using Apache Spark. This blog…

Mar 24, 2025

Deep Dive into Apache Spark Jobs and Stages

Mar 24, 2025

Finger Detection and Tracking using OpenCV and Python

TL;DR. Code is here.

Jul 28, 2018

Finger Detection and Tracking using OpenCV and Python

Jul 28, 2018

What is Google Summer of Code? How to prepare for it?

We will talk about Google Summer of Code but before that let’s talk about what Open Source Development is. Yes, it’s very important.

Jul 2, 2017

Jul 2, 2017

Amar Prakash Pandey

Amar Prakash Pandey

Following

Help

Status

About

Careers

Press

Blog

Privacy

Rules

Terms

Text to speech