Tag: Notes
All the articles with the tag "Notes".
-
Sending Events to Multiple PostHog Projects from the Same Website
In some architectures, a single website needs to send analytics events to multiple PostHog projects. This situation commonly appears in the following scenarios: Environment separation (development,
-
Lambda vs n8n: A Simple Explanation for Data Workflows
Introduction When building data systems or integrating APIs, a common question appears: should we use AWS Lambda or n8n? Both tools can automate processes, call APIs, and move data between systems,
-
Should You Use AWS Lambda or AWS Glue to Update Records in HubSpot?
When integrating HubSpot with a data platform on AWS, a common architectural decision appears quickly: Should updates to HubSpot be executed from AWS Lambda or AWS Glue? The correct choice depends on
-
Can You Know the Location of an IPv6 Address?
Example IPv6: 2600:100e:b0c7:7403:f88c:92d0:bc41:46ff Short answer: only approximately , and with significant limitations. This article explains what can and cannot be inferred from an IPv6 address,
-
HDFS vs. Object Storage: The Battle for Distributed Storage
Distributed storage has always been the foundation of Big Data. In the early days, Hadoop Distributed File System (HDFS) was the de facto standard. Today, however, object storage systems like Amazon
-
The History of Hive and Trino: From Hadoop to Lakehouses
The evolution of Big Data architectures is deeply tied to the history of two projects born at Facebook: Hive and Trino . Both emerged from real engineering pain points, but at different times and for
-
What Is a Data Lake and What Is a Data Lakehouse?
Over the last decade, the world of data architecture has gone through several transformations. From traditional data warehouses to Hadoop-based data lakes and now to the emerging Lakehouse paradigm,
-
Incremental Data Loads: Choosing Between resource_version and created_at/updated_at
Incremental data loading is a cornerstone of modern data engineering pipelines. Instead of re-ingesting entire datasets on each execution, incremental strategies focus on retrieving only records that