By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
World of SoftwareWorld of SoftwareWorld of Software
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Search
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
Reading: Airbnb Executes Istio Upgrades at Massive Scale
Share
Sign In
Notification Show More
Font ResizerAa
World of SoftwareWorld of Software
Font ResizerAa
  • Software
  • Mobile
  • Computing
  • Gadget
  • Gaming
  • Videos
Search
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Have an existing account? Sign In
Follow US
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
World of Software > News > Airbnb Executes Istio Upgrades at Massive Scale
News

Airbnb Executes Istio Upgrades at Massive Scale

News Room
Last updated: 2025/08/29 at 8:04 AM
News Room Published 29 August 2025
Share
SHARE

Airbnb engineering has published a detailed account of how it maintains high availability during Istio upgrades across tens of thousands of pods and thousands of VMs, all without downtime. The company’s service mesh infrastructure supports workloads in both Kubernetes and VM environments, handling tens of millions of queries per second at peak. Despite the complexity, Airbnb has completed Istio upgrades 14 times to date.

The key challenge lies in coordinating upgrades across diverse workloads owned by different teams. To address this, Airbnb designed an upgrade pipeline that “guarantees” zero downtime, enables gradual rollouts, supports failback, and ensures all workloads are updated within a fixed timeframe.

Technically, the process relies on a canary-style dual-version deployment of Istio control planes, each distinguished by a revision label (e.g., 1-24-5, 1-25-2). Workloads are pinned to specific revisions via the mutating webhook, which injects the appropriate istio-proxy sidecar. Upgrading artfully transitions select workloads to the new version based on distribution rules defined in a rollouts.yml file.

To eliminate manual label updates across numerous teams, Airbnb leverages Krispr, an internal mutation framework. During CI, Krispr injects the correct revision label into workload specs based on the rollout configuration. It also continuously migrates older pods via admission-time mutation, ensuring that within four weeks, all workloads transition smoothly, even inactive ones.

For VM workloads, Airbnb uses mxagent, a daemon that polls version tags on each host and atomically upgrades both the istio-proxy and its configuration when necessary. A central controller (mxrc) coordinates VM rollouts, respecting health checks and upgrade safety thresholds similar to Kubernetes’ maxUnavailable semantics.

Alongside Airbnb’s recent successful service mesh upgrades, other companies have approached the idea of service mesh upgrades a little differently:

Netflix has introduced its own zero-config service mesh. Instead of relying on a heavy control plane model, Netflix designed a mesh that automatically manages service discovery, retries, and traffic routing without requiring manual configuration. In doing so, Netflix sidesteps the coordination challenges of multi-version Istio upgrades, while still gaining the traffic management and reliability benefits that a service mesh provides.

LinkedIn, which runs one of the largest Kubernetes deployments, uses a mix of canary deployments and traffic mirroring for upgrades to core infrastructure, including Kafka and networking layers. For its service-to-service communication stack, LinkedIn has experimented with Envoy-based solutions but leans on gradual rollout pipelines with mirrored traffic for safety. This approach is conceptually similar to Airbnb’s dual Istio revisions: both allow traffic to be validated against new versions before flipping fully.

As one of the creators of Istio, Google Cloud itself has pioneered multi-revision control planes for customers, similar to Airbnb’s implementation. GKE now allows operators to run multiple Istio versions side by side, easing rollouts and failback. Google is also pushing Ambient Mode, which replaces sidecars with lightweight data-plane proxies, reducing upgrade blast radius significantly. Airbnb has expressed interest in Ambient Mode, signaling alignment with Google’s next-gen mesh direction.

Uber runs an internal mesh framework built on Envoy that integrates closely with its custom service discovery system. Their upgrade strategy often involves progressive deployment by cluster rather than fine-grained revision pinning. Uber has invested in tooling to automate rollback and enforce SLA monitoring during upgrades, somewhat mirroring Airbnb’s mxagent + mxrc setup for VMs.

These comparisons illustrate a broader industry trend: investing in advanced rollout frameworks or mesh innovations to balance complexity, reliability, and operational control.

Looking ahead, Airbnb plans to explore Istio’s Ambient Mode for a more lightweight mesh setup and splitting meshes to limit blast radius and enhance isolation.

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.
By signing up, you agree to our Terms of Use and acknowledge the data practices in our Privacy Policy. You may unsubscribe at any time.
Share This Article
Facebook Twitter Email Print
Share
What do you think?
Love0
Sad0
Happy0
Sleepy0
Angry0
Dead0
Wink0
Previous Article How to Create Purchase Order Lines in Oracle R12 (with PL/SQL Examples) | HackerNoon
Next Article New ‘Robin Hood’ series with ‘Game of Thrones’ star just got its first trailer and release date — here’s where you can stream it
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Stay Connected

248.1k Like
69.1k Follow
134k Pin
54.3k Follow

Latest News

The Internet Revolutionized Porn. Age Verification Could Upend Everything
Gadget
Startup Radar: Hiring and HR tools, dementia care, AI for financial advisors, and smart home lights
Computing
5 secret Kindle app tricks your e-reader can’t do
News
I couldn’t trust anyone after Alien: Earth, and these 6 shows & movies made it even worse
Computing

You Might also Like

News

5 secret Kindle app tricks your e-reader can’t do

9 Min Read
News

This 2nd-gen 4K Google TV streaming stick is now available in more regions

3 Min Read
News

'1984' Macintosh ad writer Steve Hayden dies aged 78

1 Min Read
News

I watched the Dexter revival—here’s how it actually holds up

7 Min Read
//

World of Software is your one-stop website for the latest tech news and updates, follow us now to get the news that matters to you.

Quick Link

  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact

Topics

  • Computing
  • Software
  • Press Release
  • Trending

Sign Up for Our Newsletter

Subscribe to our newsletter to get our newest articles instantly!

World of SoftwareWorld of Software
Follow US
Copyright © All Rights Reserved. World of Software.
Welcome Back!

Sign in to your account

Lost your password?