By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
World of SoftwareWorld of SoftwareWorld of Software
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Search
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
Reading: Uber and OpenAI Retool Rate Limiting Systems
Share
Sign In
Notification Show More
Font ResizerAa
World of SoftwareWorld of Software
Font ResizerAa
  • Software
  • Mobile
  • Computing
  • Gadget
  • Gaming
  • Videos
Search
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Have an existing account? Sign In
Follow US
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
World of Software > News > Uber and OpenAI Retool Rate Limiting Systems
News

Uber and OpenAI Retool Rate Limiting Systems

News Room
Last updated: 2026/02/17 at 7:01 PM
News Room Published 17 February 2026
Share
Uber and OpenAI Retool Rate Limiting Systems
SHARE

In recent blog posts, both Uber (Uber’s Rate Limiting System), and OpenAI (Beyond rate limits: scaling access to Codex and Sora) discuss shifts in their approach to rate limiting: moving from counter-based, per-service limits to adaptive, policy-based systems. Both companies developed proprietary rate-limiting platforms implemented at the infrastructure layer. These systems feature soft controls that manage traffic by asserting pressure on clients rather than utilizing hard stops – either through probabilistic shedding or credit-based waterfalls – ensuring system resilience without sacrificing user momentum.

Previously, Uber engineers implemented rate limits per service, commonly using token buckets backed by Redis. This caused operational inefficiencies, such as additional latency and the need for deployments just to adjust thresholds. Inconsistent configurations increased maintenance risk and resulted in uneven protection, leaving some smaller services without any limits. Additionally, observability was fragmented, making it difficult to pinpoint problems caused specifically by rate limiting.

Uber replaced these legacy limiters with a new Global Rate Limiter (GRL). The GRL architecture consists of a three-tier feedback loop: rate-limit clients in Uber’s service mesh data plane enforce decisions locally, zone aggregators collect metrics, and regional controllers calculate global limits to push back to the clients.

GRL also replaced hard-stop buckets with a system that drops a configurable percentage of traffic (e.g., 10%). This policy acts as a soft limit that exerts pressure on caller services, allowing them to remain operational rather than being shut down due to exhausted quotas.

OpenAI implemented its new rate limiter with a similar architecture; however, the primary driver was the user experience of the Codex and Sora applications rather than operational resiliency. With growing adoption, OpenAI saw a consistent pattern: users found significant value in the tools only to be interrupted by rate limits. While these boundaries ensured fair access and system stability, they frequently frustrated engaged users. OpenAI sought a way to maintain momentum without discouraging exploration through immediate usage-based billing.

The engineering team designed a combined approach that allows users to access the system up to a limit, after which the system deducts from a credit balance. The team describes this decision-making process as a “waterfall”:

This model reflects how users actually experience the product. Rate limits, free tiers, credits, promotions, and enterprise entitlements are all just layers in the same decision stack. From a user’s perspective, they don’t “switch systems”—they just keep using Codex and Sora. That’s why credits feel invisible: they’re just another element in the waterfall.

To ensure this transition is seamless, OpenAI built a dedicated real-time access engine that consolidates usage tracking, rate-limit windows, and credit balances into a single evaluation path. Unlike traditional asynchronous billing systems that suffer from lag, this engine makes a provably correct decision synchronously: every request identifies the available capacity in the rate-limit tier before instantly checking for a credit balance if that limit is exceeded.

To maintain low latency, the system settles credit debits asynchronously through a streaming processor, using stable idempotency keys to prevent double-charging. This architecture relies on three tightly coupled data streams – product usage events, monetization events, and balance updates – ensuring every transaction is auditable and reconcilable without interrupting the user’s creative flow.

Both Uber and OpenAI report that these architectural shifts have successfully met their respective operational and product goals. At Uber, the implementation of the Global Rate Limiter has scaled to process over 80 million requests per second across 1,100 services, significantly reducing tail latency by removing external Redis dependencies. The system has demonstrated its effectiveness in production by absorbing a 15x traffic surge without degradation and mitigating DDoS attacks before they reached internal systems.

Similarly, OpenAI has integrated its credit system into the access path for Codex and Sora, replacing hard stops with a continuous waterfall model. The platform provides real-time, accurate billing while maintaining the low-latency performance required for interactive AI applications. For both companies, the move to in-house, infrastructure-level platforms has replaced manual configuration with automated, adaptive controls, allowing their respective fleets to handle massive scale with minimal human intervention.

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.
By signing up, you agree to our Terms of Use and acknowledge the data practices in our Privacy Policy. You may unsubscribe at any time.
Share This Article
Facebook Twitter Email Print
Share
What do you think?
Love0
Sad0
Happy0
Sleepy0
Angry0
Dead0
Wink0
Previous Article Cognition Reveals Devin, The First Autonomous AI Engineer Cognition Reveals Devin, The First Autonomous AI Engineer
Next Article Organic vs. Paid Social Media: Which Strategy Is More Effective?
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Stay Connected

248.1k Like
69.1k Follow
134k Pin
54.3k Follow

Latest News

Starmer to extend online safety rules to AI chatbots after Grok scandal
Starmer to extend online safety rules to AI chatbots after Grok scandal
News
What is Viral Marketing? And How to Go Viral on Social Media in 2025
Computing
Samsung teases AI image editor for upcoming Galaxy S26 phones
Samsung teases AI image editor for upcoming Galaxy S26 phones
News
The Strategic Role of Social Media Most Brands Ignore
The Strategic Role of Social Media Most Brands Ignore
Computing

You Might also Like

Starmer to extend online safety rules to AI chatbots after Grok scandal
News

Starmer to extend online safety rules to AI chatbots after Grok scandal

7 Min Read
Samsung teases AI image editor for upcoming Galaxy S26 phones
News

Samsung teases AI image editor for upcoming Galaxy S26 phones

2 Min Read
Your Windows 11 Start Menu May Look A Little Different Soon – BGR
News

Your Windows 11 Start Menu May Look A Little Different Soon – BGR

6 Min Read
Meta’s new deal with Nvidia buys up millions of AI chips
News

Meta’s new deal with Nvidia buys up millions of AI chips

1 Min Read
//

World of Software is your one-stop website for the latest tech news and updates, follow us now to get the news that matters to you.

Quick Link

  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact

Topics

  • Computing
  • Software
  • Press Release
  • Trending

Sign Up for Our Newsletter

Subscribe to our newsletter to get our newest articles instantly!

World of SoftwareWorld of Software
Follow US
Copyright © All Rights Reserved. World of Software.
Welcome Back!

Sign in to your account

Lost your password?