Tag: Notes
All the articles with the tag "Notes".
-
From HDFS to S3: The Evolution of Data Lakes in the Cloud
For years, HDFS (Hadoop Distributed File System) was the default choice for building data lakes in on-premises and Hadoop-based environments. But as cloud computing gained momentum, a new player took
-
The History and Evolution of Amazon S3: Was It Ever Based on HDFS?
When discussing cloud storage today, Amazon S3 is almost synonymous with scalable, reliable object storage. However, a common question among those familiar with big data technologies like Hadoop is:
-
OLTP vs. OLAP: How JOINs and Efficiency Shape Their Differences
Online Transaction Processing (OLTP) and Online Analytical Processing (OLAP) are two distinct database architectures, each designed for different purposes. One key factor that differentiates them is
-
The Origins of OLTP and OLAP: A Brief History
Online Transaction Processing (OLTP) and Online Analytical Processing (OLAP) are fundamental concepts in database management, each serving distinct purposes. But when did these terms first appear, and
-
Enabling Internet Access for Resources in a Public Subnet
When deploying resources in a public subnet within an AWS Virtual Private Cloud (VPC), you need to configure several components to allow them to communicate with the internet. Below are the essential
-
Network Address Translation (NAT): Overcoming IPv4 Shortages
Introduction Network Address Translation (NAT) is a technology designed to mitigate the shortage of IPv4 addresses by allowing multiple devices on a private network to share a limited number of public
-
Why OLTP Systems Don't Retain Historical Changes
Online Transaction Processing (OLTP) systems are designed for high-speed transactions and efficient data management. However, one of their characteristics is that they do not retain historical changes
-
Understanding the Relationship Between Database Replication and the CAP Theorem
Introduction Database replication is a fundamental strategy in distributed systems that ensures data is duplicated across multiple nodes. However, when designing a replicated database, one must