By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
World of SoftwareWorld of SoftwareWorld of Software
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Search
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
Reading: Amazon S3 Adds Sort and Z-Order Compaction to Improve Apache Iceberg Query Performance
Share
Sign In
Notification Show More
Font ResizerAa
World of SoftwareWorld of Software
Font ResizerAa
  • Software
  • Mobile
  • Computing
  • Gadget
  • Gaming
  • Videos
Search
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Have an existing account? Sign In
Follow US
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
World of Software > News > Amazon S3 Adds Sort and Z-Order Compaction to Improve Apache Iceberg Query Performance
News

Amazon S3 Adds Sort and Z-Order Compaction to Improve Apache Iceberg Query Performance

News Room
Last updated: 2025/07/16 at 1:57 PM
News Room Published 16 July 2025
Share
SHARE

AWS has recently announced that Amazon S3 now supports sort and z-order compaction for Apache Iceberg tables. The new features reduce scan times and engine costs, and are available for both S3 Tables and traditional S3 buckets using AWS Glue Data Catalog optimization.

Sort compaction minimizes the number of data files scanned by query engines, while z-order compaction provides additional performance benefits through efficient file pruning when querying across multiple columns simultaneously. Sébastien Stormacq, principal developer advocate at AWS, explains:

When working with high-ingest or frequently updated datasets, data lakes can accumulate many small files that impact the cost and performance of your queries. (…) Although the default binpack strategy with managed compaction provides notable performance improvements, introducing sort and z-order compaction options for both S3 and S3 Tables delivers even greater gains for queries filtering across one or more dimensions.

Sort compaction organizes files based on a user-defined column order. When tables have a defined sort order, S3 Tables compaction will now use sort to cluster similar values together during the compaction process.

In Apache Iceberg, compaction can be used to combine small files into larger files (bin packing), merge delete files with data files, sort the data in accordance with query patterns or cluster the data by using space-filling curves to optimize for distinct query patterns (z-order sorting).

S3 Tables provide a managed experience with automatic hierarchical sorting during compaction, based on defined table metadata. For equal prioritization of multiple query predicates, z-order compaction can be enabled via the maintenance API. For Iceberg tables in general-purpose S3 buckets, the compaction method can be configured in the Glue Data Catalog console. Stormacq adds:

In my experience, depending on my data layout and query patterns, I observed performance improvements of threefold or more when switching from binpack to sort or z-order.

Ruben Simon, product manager at BMW, comments:

At BMW’s largest big data analytics platform, using thousands of S3 buckets and Iceberg tables, we saw major query performance gains with Z-ordering. (…) Bloom filters next would make it even more powerful.

In the article “S3 Managed Tables, Unmanaged Costs: The 20x Surprise with AWS S3 Tables”, Vinish Reddy Pannala, software engineer at Onehouse.ai, and Kyle Weller, VP of Product at Onehouse.ai, question the lack of configurable options for compaction:

Roughly 3 hours after the table was created, S3 Tables finally triggered compaction executing 10 replace operations and compacting approximately 100 GB of data over the course of 1 hour. (…) This exposes a deeper flaw in the S3Tables approach, where it does not recognize that ideal compaction configurations are specific to different types of readers and writers.

Existing compacted files will remain unchanged, and only new data written after enabling sort or z-order will be affected, unless the customer explicitly rewrites data using standard Iceberg tools or by increasing the target file size in the table maintenance settings. Yonatan Dolan, principal analytics specialist at AWS, warns:

Everyone talks about Sort, Z-order, and BinPack compaction when tuning query performance in Apache Iceberg – and yes, sorting helps (when done right), and Z-order can outperform bin-packing on the right queries. But in my benchmarks using TPC-H SF100 lineitem (~600M rows / 17GB compressed), I found something even more influential: The starting size of your files before compaction can massively impact cost.

Source: Yonatan Dolan’s post

The new compaction options are available in all regions where S3 Tables are supported and, for standard S3 buckets, where integration with Glue Data Catalog is available. There are no specific costs associated with the new features.

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.
By signing up, you agree to our Terms of Use and acknowledge the data practices in our Privacy Policy. You may unsubscribe at any time.
Share This Article
Facebook Twitter Email Print
Share
What do you think?
Love0
Sad0
Happy0
Sleepy0
Angry0
Dead0
Wink0
Previous Article How AI-Powered Chatbots Are Shaping the Future of Customer Engagement: The Openxcell Perspective | HackerNoon
Next Article Hackers Leverage Microsoft Teams to Spread Matanbuchus 3.0 Malware to Targeted Firms
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Stay Connected

248.1k Like
69.1k Follow
134k Pin
54.3k Follow

Latest News

Warning to Brits over explosion of WASPS as population soars
News
In Sparse Clouds and Ambiguous Texts, This AI Model Still Finds Its Way | HackerNoon
Computing
Today's NYT Connections: Sports Edition Hints, Answers for July 17 #297
News
How Do You Teach I/O Before Monads? Inside a Radical Syllabus for a 1000-Student FP Course | HackerNoon
Computing

You Might also Like

News

Warning to Brits over explosion of WASPS as population soars

4 Min Read
News

Today's NYT Connections: Sports Edition Hints, Answers for July 17 #297

3 Min Read
News

Amid Starlink’s Expansion, FCC Moves to Speed Up Satellite Approvals

6 Min Read
News

Certain Chinese made iPhones face a ban in the United States

4 Min Read
//

World of Software is your one-stop website for the latest tech news and updates, follow us now to get the news that matters to you.

Quick Link

  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact

Topics

  • Computing
  • Software
  • Press Release
  • Trending

Sign Up for Our Newsletter

Subscribe to our newsletter to get our newest articles instantly!

World of SoftwareWorld of Software
Follow US
Copyright © All Rights Reserved. World of Software.
Welcome Back!

Sign in to your account

Lost your password?