Posts

All the articles I've posted.

Why Parquet Became the Standard for Analytics

23 Sep, 2025

In the early days of Big Data, data was often stored in simple formats such as CSV, JSON, or text logs. While these formats were easy to generate and understand, they quickly became inefficient at
Facebook and Big Data: The Open Source Projects That Changed the Industry

19 Sep, 2025

When people talk about the history of Big Data, a few companies come to mind: Google, Yahoo, and Facebook. Each of them faced unique challenges that forced them to build large-scale distributed
HDFS vs. Object Storage: The Battle for Distributed Storage

19 Sep, 2025

Distributed storage has always been the foundation of Big Data. In the early days, Hadoop Distributed File System (HDFS) was the de facto standard. Today, however, object storage systems like Amazon
The History of Hive and Trino: From Hadoop to Lakehouses

19 Sep, 2025

The evolution of Big Data architectures is deeply tied to the history of two projects born at Facebook: Hive and Trino . Both emerged from real engineering pain points, but at different times and for
What Is a Data Lake and What Is a Data Lakehouse?

19 Sep, 2025

Over the last decade, the world of data architecture has gone through several transformations. From traditional data warehouses to Hadoop-based data lakes and now to the emerging Lakehouse paradigm,
Google Bigtable vs. Amazon DynamoDB: Understanding the Differences

17 Sep, 2025

When choosing a NoSQL database for scalable, low-latency applications, two major options stand out: Google Cloud Bigtable and Amazon DynamoDB . While both are managed, highly available, and
How to Keep a Docker Container Running Persistently

17 Sep, 2025

When working with Docker, you may have noticed that some containers stop as soon as you exit the shell. This is because Docker considers the container's main process to have finished. In this post, we
Fixing Cursor Login Issues on Linux (AppImage)

16 Sep, 2025

When running Cursor on Linux, especially with the AppImage version, you might encounter a situation where you can’t log in. This usually happens because Cursor stores its session state locally, and

Posts

Why Parquet Became the Standard for Analytics

Facebook and Big Data: The Open Source Projects That Changed the Industry

HDFS vs. Object Storage: The Battle for Distributed Storage

The History of Hive and Trino: From Hadoop to Lakehouses

What Is a Data Lake and What Is a Data Lakehouse?

Google Bigtable vs. Amazon DynamoDB: Understanding the Differences

How to Keep a Docker Container Running Persistently

Fixing Cursor Login Issues on Linux (AppImage)