Posts

All the articles I've posted.

Running PySpark on Google Colab: Do You Still Need findspark?

5 Apr, 2025

Introduction For a long time, using Apache Spark in Google Colab required manual setup, including installing Spark and configuring Python to recognize it. This was often done using the findspark
Testing Apache Airflow DAGs: A Modular Approach

5 Apr, 2025

Introduction Apache Airflow is a powerful workflow automation tool, but testing DAGs can be challenging due to their dependency on the Airflow scheduler and execution environment. In this post, we
Visualizing EXPLAIN ANALYZE in PostgreSQL

4 Apr, 2025

When working with PostgreSQL, understanding how queries execute can greatly improve performance tuning and optimization. PostgreSQL provides the EXPLAIN ANALYZE command to help developers analyze
Enabling Internet Access for Resources in a Public Subnet

3 Apr, 2025

When deploying resources in a public subnet within an AWS Virtual Private Cloud (VPC), you need to configure several components to allow them to communicate with the internet. Below are the essential
Network Address Translation (NAT): Overcoming IPv4 Shortages

3 Apr, 2025

Introduction Network Address Translation (NAT) is a technology designed to mitigate the shortage of IPv4 addresses by allowing multiple devices on a private network to share a limited number of public
Understanding Subnets, Gateways, and Route Tables in AWS

3 Apr, 2025

When designing applications in AWS, it's crucial to understand how networking components interact within a Virtual Private Cloud (VPC). This post will cover subnets, gateways, and route tables,
Generating a Calendar Table in Power Query (M Language)

2 Apr, 2025

When working with Power BI or other Power Query-supported tools, having a well-structured calendar table is essential for time-based analysis. In this blog post, we will walk through an M Language
How to Display an Error in Excel When More Than 5 "FALSE" Values Appear in a Row

2 Apr, 2025

Introduction When working with data in Excel, there may be instances where you need to monitor certain conditions and flag errors based on specific criteria. In this guide, we'll walk through a simple

Posts

Running PySpark on Google Colab: Do You Still Need findspark?

Testing Apache Airflow DAGs: A Modular Approach

Visualizing EXPLAIN ANALYZE in PostgreSQL

Enabling Internet Access for Resources in a Public Subnet

Network Address Translation (NAT): Overcoming IPv4 Shortages

Understanding Subnets, Gateways, and Route Tables in AWS

Generating a Calendar Table in Power Query (M Language)

How to Display an Error in Excel When More Than 5 "FALSE" Values Appear in a Row