Tag: Apache Iceberg
All the articles with the tag "Apache Iceberg".
-
Choosing Between saveAsTable and Iceberg’s writeTo in AWS Glue and Athena
When working with Spark on AWS Glue , there are multiple ways to persist DataFrames as tables and make them queryable in Amazon Athena . Two common approaches are: Using Spark’s Hive-style saveAsTable
-
Optimizing Partition Strategies in Apache Iceberg on AWS
When working with large-scale analytical datasets, efficient partitioning is critical for achieving optimal query performance and cost savings. Apache Iceberg, a modern table format designed for big
-
How Transactions Work in Databricks Using Delta Lake
Databricks is a powerful platform for big data analytics and machine learning. One of its key features is the ability to run transactional workloads over large-scale data lakes using Delta Lake . This
-
When Should You Use Parquet and When Should You Use Iceberg?
In modern data architectures, selecting the right storage and management solution is essential for building efficient, reliable, and scalable pipelines. Two popular choices that often come up are
-
How to Fix 'DataFrame' object has no attribute 'writeTo' When Working with Apache Iceberg in PySpark
If you’re working with Apache Iceberg in PySpark and encounter this error: Failed to write to Iceberg table: 'DataFrame' object has no attribute 'writeTo' You’re not alone. This is a common mistake