Google Cloud Announces Rapid Storage For Millisecond-Latency Workloads

Google Cloud Announces Rapid Storage for Millisecond-Latency Workloads

Last updated: 2025/05/10 at 2:21 AM

News Room Published 10 May 2025

At the recent Google Cloud Next 2025, the cloud provider announced Rapid Storage, a new Cloud Storage zonal bucket designed to deliver consistent single-digit millisecond data access for frequently accessed data and latency-sensitive applications. The new storage class provides under 1ms random read and write latency, 20x faster data access, and 6 TB/s of throughput.

Unlike traditional Cloud Storage buckets that span multiple zones in a region, a Rapid Storage zonal bucket keeps data within the same zone where the GPUs and TPUs are, helping workloads to achieve higher throughput and sub-millisecond read/write latencies. Sameet Agarwal, VP/GM at Google, and Asad Khan, senior director PM at Google, explain the goal:

To train, checkpoint, and serve AI models at peak efficiency, you need to keep your GPU or TPUs saturated with data to minimize wasted compute (…) Traditional object storage suffers from a critical limitation: latency. Using Google’s Colossus cluster-level file system, we are delivering a new approach to colocate storage and AI accelerators in a new zonal bucket.

A Rapid Storage bucket can be mounted as a file system using Cloud Storage FUSE, allowing support for common AI frameworks such as TensorFlow and PyTorch. In the article “Colossus: the secret ingredient in Rapid Storage’s high performance,” Denis Serenyi, distinguished software engineer at Google, and Vivek Saraswat, group product manager at Google, explain how the cloud provider added a stateful gRPC-based streaming protocol and the ability to append data to an object, while maintaining the high aggregate throughput and scale of object storage.

In a popular Reddit thread, the community compared the new option with Amazon S3 Express. Akshay Shah, CTO at Buf, comments:

Very cool! This makes Google the only major cloud that has low-latency single-zone object storage, standard regional object storage, and transparently-replicated dual-region object storage – all with the same API.

The same week that Google Cloud announced the new storage class, AWS announced that S3 Express One Zone has reduced storage prices by 31%, PUT request prices by 55%, and GET request prices by 85%. Agarwal and Khan claim:

Rapid Storage delivers 5x lower latency for random reads and writes compared to other leading hyperscalers. Combined with throughput of up to 6 TB/s per bucket and up to 20 million queries per second (QPS)

Rapid Storage was not the only storage announcement made at Cloud Next 2025 targeting AI workloads: Anywhere Cache is an SSD cache that works with existing regional buckets to cache data within a selected zone, while Google Cloud Managed Lustre is a high-performance parallel file system built on the DDN EXAScaler Lustre file system, providing PB scale at under 1ms latency, millions of IOPS, and TB/s of throughput for AI workloads.

Rapid Storage is currently in preview.

Google Cloud Announces Rapid Storage for Millisecond-Latency Workloads

Leave a Reply Cancel reply

Stay Connected

Latest News

Samsung may be killing bootloader unlocking on all phones with One UI 8

DOGE has built an AI tool to slash federal regulations | News

Hilarious moment Donald Trump’s caddie appears to drop golf ball in prime spot

How to Beat Commander Honglan in Wuchang Fallen Feathers

World of Software is your one-stop website for the latest tech news and updates, follow us now to get the news that matters to you.

Quick Link

Topics

Sign Up for Our Newsletter

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.

Leave a Reply Cancel reply

Stay Connected

Latest News