By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
World of SoftwareWorld of SoftwareWorld of Software
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Search
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
Reading: Uber Moves In-House Search Indexing to Pull-Based Ingestion in OpenSearch
Share
Sign In
Notification Show More
Font ResizerAa
World of SoftwareWorld of Software
Font ResizerAa
  • Software
  • Mobile
  • Computing
  • Gadget
  • Gaming
  • Videos
Search
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Have an existing account? Sign In
Follow US
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
World of Software > News > Uber Moves In-House Search Indexing to Pull-Based Ingestion in OpenSearch
News

Uber Moves In-House Search Indexing to Pull-Based Ingestion in OpenSearch

News Room
Last updated: 2026/02/09 at 12:28 PM
News Room Published 9 February 2026
Share
Uber Moves In-House Search Indexing to Pull-Based Ingestion in OpenSearch
SHARE

Uber has moved its in-house search indexing system to OpenSearch by introducing a pull-based ingestion framework for large-scale streaming data. The change goal was to improve reliability, backpressure handling, and recovery for real-time indexing workloads, after evolving product requirements revealed the growing cost and complexity of maintaining a homegrown search platform, including challenges around schema evolution, relevance tuning, and multi-region consistency.

Uber’s search infrastructure supports ride discovery, delivery selection, and location-based queries, processing continuous event streams in near real time. Its in-house search platform relied on push-based ingestion, where upstream services wrote directly to clusters. While effective at a small scale, this approach struggled with bursts and failures, causing dropped writes and complex retries.

Pull-based ingestion shifts responsibility to the OpenSearch cluster. Shards pull data from durable streams such as Kafka or Kinesis, which act as buffers, allowing controlled rates, internal backpressure, and replay for recovery. Uber engineers report that this approach reduces indexing failures during spikes and simplifies operational recovery. Bursty traffic that previously overwhelmed shard queues is now absorbed by per-shard bounded queues, improving throughput and stability.

Push vs pull ingestion under traffic spikes (Source: Uber Tech Blog)

The pull-based pipeline includes multiple interacting components. Events are produced into Kafka or Kinesis topics, with each shard mapped to a stream partition for deterministic replay. A stream consumer polls messages into a blocking queue, decoupling consumption from processing and enabling parallel writers. Messages are processed by separate threads that validate, transform, and prepare indexing requests before passing them to the ingestion engine. The engine writes directly to Lucene, bypassing the translog, while tracking processed offsets for deterministic recovery.

Pull-based streaming ingestion architecture (Source: Uber Tech Blog)

According to Uber Engineers, pull-based ingestion also provides fine-grained operational controls. External versioning ensures out-of-order messages do not overwrite newer updates, and at-least-once processing guarantees consistency. Operators can configure failure policies: messages may be discarded under a drop policy or retried indefinitely under a block policy. APIs allow ingestion to be paused, resumed, or reset to specific offsets, helping teams manage backlogs after outages.

Uber supports two ingestion modes. Segment replication ingests data only on primary shards, with replicas fetching completed segments, reducing CPU usage with a slight visibility lag. All-active mode ingests on all shard copies, providing near-instant visibility at a higher compute cost.

Pull-based ingestion is central to Uber’s highly available, multi-region search architecture. Each regional OpenSearch cluster consumes from globally aggregated Kafka topics to build complete, up-to-date indexes. This design ensures redundancy, global consistency, and seamless failover, so users worldwide interact with a consistent search view while maintaining high availability.

Pull-based indexing model (Source: Uber Tech Blog)

Uber is gradually migrating all search use cases to OpenSearch’s pull-based ingestion, moving toward a cloud-native, scalable architecture while continuing to enhance the platform and contribute to the OpenSearch community.

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.
By signing up, you agree to our Terms of Use and acknowledge the data practices in our Privacy Policy. You may unsubscribe at any time.
Share This Article
Facebook Twitter Email Print
Share
What do you think?
Love0
Sad0
Happy0
Sleepy0
Angry0
Dead0
Wink0
Previous Article 103 ByteDance employees dismissed for corruption and other misconduct · TechNode 103 ByteDance employees dismissed for corruption and other misconduct · TechNode
Next Article GNU Linux-Libre 6.19 Deals With More Firmware Blobs In Intel Xe, IWLWIFI & NVIDIA Nova GNU Linux-Libre 6.19 Deals With More Firmware Blobs In Intel Xe, IWLWIFI & NVIDIA Nova
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Stay Connected

248.1k Like
69.1k Follow
134k Pin
54.3k Follow

Latest News

Siemens CEO Roland Busch’s mission to automate everything
Siemens CEO Roland Busch’s mission to automate everything
News
TDD Is Backwards: Why Assertions Should Come First in Disruptive Development | HackerNoon
TDD Is Backwards: Why Assertions Should Come First in Disruptive Development | HackerNoon
Computing
What’s AI.com, the mysterious website with the Super Bowl commercial?
What’s AI.com, the mysterious website with the Super Bowl commercial?
News
It’s not just the iPhone 17e, Apple has multiple iPad updates coming too
It’s not just the iPhone 17e, Apple has multiple iPad updates coming too
Gadget

You Might also Like

Siemens CEO Roland Busch’s mission to automate everything
News

Siemens CEO Roland Busch’s mission to automate everything

78 Min Read
What’s AI.com, the mysterious website with the Super Bowl commercial?
News

What’s AI.com, the mysterious website with the Super Bowl commercial?

5 Min Read
ChatGPT rolls out ads |  News
News

ChatGPT rolls out ads | News

3 Min Read
Did ReThink Survive After Shark Tank? Here’s What Happened After Season 8 – BGR
News

Did ReThink Survive After Shark Tank? Here’s What Happened After Season 8 – BGR

6 Min Read
//

World of Software is your one-stop website for the latest tech news and updates, follow us now to get the news that matters to you.

Quick Link

  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact

Topics

  • Computing
  • Software
  • Press Release
  • Trending

Sign Up for Our Newsletter

Subscribe to our newsletter to get our newest articles instantly!

World of SoftwareWorld of Software
Follow US
Copyright © All Rights Reserved. World of Software.
Welcome Back!

Sign in to your account

Lost your password?