Tag: Apache Iceberg

All the articles with the tag "Apache Iceberg".

Choosing Between saveAsTable and Iceberg’s writeTo in AWS Glue and Athena

3 Sep, 2025

When working with Spark on AWS Glue , there are multiple ways to persist DataFrames as tables and make them queryable in Amazon Athena . Two common approaches are: Using Spark’s Hive-style saveAsTable
Optimizing Partition Strategies in Apache Iceberg on AWS

24 Jul, 2025

When working with large-scale analytical datasets, efficient partitioning is critical for achieving optimal query performance and cost savings. Apache Iceberg, a modern table format designed for big
How Transactions Work in Databricks Using Delta Lake

22 Jul, 2025

Databricks is a powerful platform for big data analytics and machine learning. One of its key features is the ability to run transactional workloads over large-scale data lakes using Delta Lake . This
When Should You Use Iceberg with Athena? Partitioning Strategies and Best Practices

5 Jul, 2025

As data lakes grow in size and complexity, tools like Amazon Athena combined with table formats like Apache Iceberg become essential for scalability, data governance, and performance. In this post,
When Should You Use Parquet and When Should You Use Iceberg?

18 Jun, 2025

In modern data architectures, selecting the right storage and management solution is essential for building efficient, reliable, and scalable pipelines. Two popular choices that often come up are
How to Fix 'DataFrame' object has no attribute 'writeTo' When Working with Apache Iceberg in PySpark

17 Jun, 2025

If you’re working with Apache Iceberg in PySpark and encounter this error: Failed to write to Iceberg table: 'DataFrame' object has no attribute 'writeTo' You’re not alone. This is a common mistake

Tag: Apache Iceberg

Choosing Between saveAsTable and Iceberg’s writeTo in AWS Glue and Athena

Optimizing Partition Strategies in Apache Iceberg on AWS

How Transactions Work in Databricks Using Delta Lake

When Should You Use Iceberg with Athena? Partitioning Strategies and Best Practices

When Should You Use Parquet and When Should You Use Iceberg?

How to Fix 'DataFrame' object has no attribute 'writeTo' When Working with Apache Iceberg in PySpark