Posts

All the articles I've posted.

How to Fix 'DataFrame' object has no attribute 'writeTo' When Working with Apache Iceberg in PySpark

17 Jun, 2025

If you’re working with Apache Iceberg in PySpark and encounter this error: Failed to write to Iceberg table: 'DataFrame' object has no attribute 'writeTo' You’re not alone. This is a common mistake
What Is Sharding and Why It Matters

21 May, 2025

As our world becomes increasingly digital, the amount of data we create every day is staggering. Think about all the emails, messages, orders, and photos uploaded every second. How do big companies
From Tables to Partitions: Designing NoSQL Databases with Cassandra

20 May, 2025

As data professionals transition from relational databases to NoSQL systems like Apache Cassandra, one of the most important mindset shifts is understanding that you don't model data for storage, but
Apache Cassandra vs Apache Parquet: Understanding the Differences

14 May, 2025

In modern data architectures, it's common to encounter both Apache Cassandra and Apache Parquet , particularly when dealing with large-scale, distributed systems. Both technologies are associated with
Import Live Crypto Prices into Google Sheets

11 May, 2025

Are you tired of checking crypto prices manually? Want to automate your portfolio tracking or build a custom crypto dashboard? Good news — with just a few steps, you can pull live cryptocurrency
Fixing Spark Ivy Error in Docker: "basedir must be absolute"

9 May, 2025

If you're running Apache Spark inside Docker using Bitnami's images and suddenly encounter an Ivy error that says: Exception in thread "main" java.lang.IllegalArgumentException: basedir must be
How Dynamo Reshaped the Internal Architecture of Amazon S3

9 May, 2025

Introduction Amazon S3 launched in 2006 as a scalable, durable object storage system. It avoided hierarchical file systems and used flat key-based addressing from day one. However, early versions of
What’s Behind Amazon S3?

9 May, 2025

When you upload a file to the cloud using an app or service, there's a good chance it's being stored on Amazon S3 (Simple Storage Service). But what powers it under the hood? What is Amazon S3? Amazon

Posts

How to Fix 'DataFrame' object has no attribute 'writeTo' When Working with Apache Iceberg in PySpark

What Is Sharding and Why It Matters

From Tables to Partitions: Designing NoSQL Databases with Cassandra

Apache Cassandra vs Apache Parquet: Understanding the Differences

Import Live Crypto Prices into Google Sheets

Fixing Spark Ivy Error in Docker: "basedir must be absolute"

How Dynamo Reshaped the Internal Architecture of Amazon S3

What’s Behind Amazon S3?