By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
World of SoftwareWorld of SoftwareWorld of Software
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Search
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
Reading: How Dropbox Built a Scalable Context Engine for Enterprise Knowledge Search
Share
Sign In
Notification Show More
Font ResizerAa
World of SoftwareWorld of Software
Font ResizerAa
  • Software
  • Mobile
  • Computing
  • Gadget
  • Gaming
  • Videos
Search
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Have an existing account? Sign In
Follow US
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
World of Software > News > How Dropbox Built a Scalable Context Engine for Enterprise Knowledge Search
News

How Dropbox Built a Scalable Context Engine for Enterprise Knowledge Search

News Room
Last updated: 2026/02/18 at 3:27 AM
News Room Published 18 February 2026
Share
How Dropbox Built a Scalable Context Engine for Enterprise Knowledge Search
SHARE

Dropbox engineers have detailed how the organization was able to build the context engine behind Dropbox Dash, demonstrating a shift towards index-based retrieval, knowledge graph-derived context, and continuous evaluation to support enterprise AI knowledge retrieval at scale. The design points to a broader pattern emerging across enterprise assistants, whereby teams are deliberately constraining their live tool usage and instead relying more heavily on pre-processed, permission-aware context to speed latency, improve quality and ease token pressure.

As part of a recent engineering talk, Dropbox VP of Engineering, Josh Clemm described their application as a response to work in enterprises being distributed across dozens of SaaS applications, each with their own distinct APIs, permission structures and rate limits. Despite the latest language models incorporating reasoning, Clemm said they lack direct access to an enterprise’s data for context. This leads to additional infrastructure being necessary to retrieve potentially sensitive information safely.

The architecture at the center of Dash relies on pre-processing content rather than runtime inference retrieval. Data from the connected knowledge applications is normalized, enriched and indexed before a query is made using a mix of lexical search and dense vectors. This allows the application to return results without having to create a spiderweb of API calls at query-time.

This method does incur higher complexity and storage costs, but Dropbox felt that the investment was worthwhile given the benefits of offline ranking experiments, improved relevance signals and predictable query-time performance.

One of the main components of the Dash application is the use of knowledge graphs to create models of relationships across common business-centric media (people, documents, meetings etc). However, rather than querying a graph database at runtime, “knowledge bundles” are derived and fed into the aforementioned indexing pipeline. Clemm states that earlier experiments with graph databases led to latency and query-pattern changes, this resulted in the team treating graph information as part of the context enrichment rather than another layer to query.

The team also described the challenges associated with exposing multiple tools directly to language models through the MCP (Model Context Protocol). Citing context window consumption, Dropbox observed degraded agent performance and slow queries when many of the tools were used asynchronously. To get around this, the team consolidated retrieval behind a small number of high-level tools. These tools were able to retrieve context outside of the prompt and route more complex requests to agents with narrower scopes.

The MCP’s creators have also expressed their concerns about context window consumption when using multiple tools. They state that each addition requires careful managment.

Outside of retrieval, Dropbox touched on the importance of label evaluation at scale. Since the results from queries are consumed by language models rather than humans, traditional click-based relevance signals do not work. Dropbox was able to use language models as judges to measure and score retrieval quality. This allowed them to refine their prompts and ranking logic, thereby reducing labelling disagreements with their human users.

The Dash team was able to operationalize the evaluation process using DSPy, a framework for prompt optimization. Clemm states that DSPy was able to manage more than 30 prompts across all workflows. This allowed faster model switching without manually rewriting each model’s prompts.

The approach taken by the Dash team, closely resembles patterns seen in other enterprise knowledge assistants. Microsoft 365 Copilot also relies on a pre-computed semantic index derived from the Microsoft Graph to retrieve context efficiently.

Together, these designs point to a growing signal around treating context as a first-class system in enterprise AI, rather than something assembled on the fly at inference time. As teams scale their internal search and agent capabilities within large organizations, architectures that pre-compute, constrain, and continuously evaluate context appear to be becoming a more common foundation.

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.
By signing up, you agree to our Terms of Use and acknowledge the data practices in our Privacy Policy. You may unsubscribe at any time.
Share This Article
Facebook Twitter Email Print
Share
What do you think?
Love0
Sad0
Happy0
Sleepy0
Angry0
Dead0
Wink0
Previous Article Tencent and 2K team up to launch mobile game NBA 2K All Star on March 25 · TechNode Tencent and 2K team up to launch mobile game NBA 2K All Star on March 25 · TechNode
Next Article AlphaTON Capital Deploys 504 NVIDIA Blackwell B200 GPUs to Power Telegram’s AI Infrastructure | HackerNoon AlphaTON Capital Deploys 504 NVIDIA Blackwell B200 GPUs to Power Telegram’s AI Infrastructure | HackerNoon
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Stay Connected

248.1k Like
69.1k Follow
134k Pin
54.3k Follow

Latest News

How to Get Followers on Threads in 2025 (+ Brand Inspo)
How to Get Followers on Threads in 2025 (+ Brand Inspo)
Computing
Samsung and Apple’s “wide” Folds prove that the Pixel Fold was visionary
Samsung and Apple’s “wide” Folds prove that the Pixel Fold was visionary
News
The Ghost in the Warehouse: How to Solve Schema Drift in Analytical AI Agents | HackerNoon
The Ghost in the Warehouse: How to Solve Schema Drift in Analytical AI Agents | HackerNoon
Computing
Today's NYT Wordle Hints, Answer and Help for Feb. 18 #1705 – CNET
Today's NYT Wordle Hints, Answer and Help for Feb. 18 #1705 – CNET
News

You Might also Like

Samsung and Apple’s “wide” Folds prove that the Pixel Fold was visionary
News

Samsung and Apple’s “wide” Folds prove that the Pixel Fold was visionary

13 Min Read
Today's NYT Wordle Hints, Answer and Help for Feb. 18 #1705 – CNET
News

Today's NYT Wordle Hints, Answer and Help for Feb. 18 #1705 – CNET

2 Min Read
U.S. court bars OpenAI from using ‘Cameo’ |  News
News

U.S. court bars OpenAI from using ‘Cameo’ | News

3 Min Read
Fyld eyes US expansion as AI worksite safety group raises £32m – UKTN
News

Fyld eyes US expansion as AI worksite safety group raises £32m – UKTN

2 Min Read
//

World of Software is your one-stop website for the latest tech news and updates, follow us now to get the news that matters to you.

Quick Link

  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact

Topics

  • Computing
  • Software
  • Press Release
  • Trending

Sign Up for Our Newsletter

Subscribe to our newsletter to get our newest articles instantly!

World of SoftwareWorld of Software
Follow US
Copyright © All Rights Reserved. World of Software.
Welcome Back!

Sign in to your account

Lost your password?