Tag: Notes
All the articles with the tag "Notes".
-
he Enduring Relevance of Peter Chen’s Entity-Relationship Model
In the landscape of data modeling, few contributions have had the long-lasting impact of Peter Chen’s Entity-Relationship (E-R) Model , introduced in 1976. More than four decades later, it remains a
-
EMR vs AWS Glue: Choosing the Right Data Processing Tool on AWS
When working with big data on AWS, two commonly used services for data processing are Amazon EMR and AWS Glue . Although both support scalable data transformation and analytics, they differ
-
How Hadoop Made Specialized Storage Hardware Obsolete
In the early 2000s, enterprise data processing was dominated by high-end hardware. Organizations relied heavily on centralized storage systems such as SAN (Storage Area Networks) and NAS (Network
-
When Should You Use Iceberg with Athena? Partitioning Strategies and Best Practices
As data lakes grow in size and complexity, tools like Amazon Athena combined with table formats like Apache Iceberg become essential for scalability, data governance, and performance. In this post,
-
How Google Changed Big Data: The Story of GFS, MapReduce, and Bigtable
In the early 2000s, Google faced a unique challenge: how to store, process, and query massive amounts of data across thousands of unreliable machines. The traditional systems of the time—designed for
-
ecure Database Access in AWS Using SSH Tunneling
Accessing databases located in private subnets within AWS Virtual Private Clouds (VPCs) is a common requirement in enterprise architectures. To ensure secure connectivity without exposing the database
-
Did Early Personal Computers Really Have a CPU? A Look at the von Neumann Architecture
When we think of a personal computer (PC), we typically imagine a processor, memory, a keyboard, and a display. But a deeper question often goes unasked: Did all early personal computers actually
-
How Network Topology Shapes Distributed Computing and Big Data Systems
When discussing distributed systems and Big Data, people often focus on storage, processing frameworks, and scalability—but one foundational concept underlies it all: network topology . It’s the