By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
World of SoftwareWorld of SoftwareWorld of Software
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Search
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
Reading: Google DeepMind Launches Gemini 2.5 Computer Use Model to Power UI-Controlling AI Agents
Share
Sign In
Notification Show More
Font ResizerAa
World of SoftwareWorld of Software
Font ResizerAa
  • Software
  • Mobile
  • Computing
  • Gadget
  • Gaming
  • Videos
Search
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Have an existing account? Sign In
Follow US
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
World of Software > News > Google DeepMind Launches Gemini 2.5 Computer Use Model to Power UI-Controlling AI Agents
News

Google DeepMind Launches Gemini 2.5 Computer Use Model to Power UI-Controlling AI Agents

News Room
Last updated: 2025/10/09 at 5:21 PM
News Room Published 9 October 2025
Share
SHARE

Google DeepMind has recently released the Gemini 2.5 Computer Use model, a specialized variant of its Gemini 2.5 Pro system designed to enable AI agents to interact directly with graphical user interfaces. The new model allows developers to build agents that can click, type, scroll, and manipulate interactive elements on web pages.

The Computer Use model brings Gemini’s multimodal reasoning and visual understanding to environments like browsers and mobile apps, where AI must perceive the on-screen context and act accordingly. Early evaluations show the model performing strongly on several interface control benchmarks, including Online-Mind2Web, WebVoyager, and AndroidWorld. In tests reported by DeepMind and Browserbase, it reached around 70% accuracy on the Online-Mind2Web benchmark, with response times shorter than those of other publicly evaluated systems.

In practical terms, the model operates in a loop via a new computer_use tool exposed through the Gemini API. Developers provide the model with a screenshot of the environment, a task description, and a record of previous actions. The model then returns structured function calls representing actions such as “click,” “type,” or “scroll.” The client executes these actions, captures a new screenshot, and feeds it back to the model — repeating the cycle until the task is complete.

While currently optimized for browser environments, the Computer Use model also shows strong promise for mobile UI control, signaling potential expansion to desktop operating systems in the future.

The launch has sparked critical discussion among developers. Wissam Benhaddad, a senior data science consultant, noted that while the approach is promising, practical deployment remains challenging:

This type of solution is promising, but I do not think it is production-ready yet. Current implementations are extremely slow and can often be replaced by standard API calls or direct app integrations. In my view, reasoning should not happen at the LLM level but rather within a latent space where information can move in a more compressed and efficient way — which is what Deep Learning excels at. I hope to see this kind of product evolve in that direction.

DeepMind emphasizes that safety guardrails are central to the system’s design. The Gemini 2.5 Computer Use model integrates protections against malicious prompts, unsafe actions, and scams within web environments. Each model action is assessed through a per-step safety service before execution, and developers can require user confirmation for sensitive operations such as purchases or system-level interactions.

The model’s system card outlines how these safety features mitigate potential risks while allowing developers to maintain full oversight. DeepMind advises thorough testing before deploying agents to production.

Gemini 2.5 Computer Use is available now in preview via the Gemini API in Google AI Studio and Vertex AI.

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.
By signing up, you agree to our Terms of Use and acknowledge the data practices in our Privacy Policy. You may unsubscribe at any time.
Share This Article
Facebook Twitter Email Print
Share
What do you think?
Love0
Sad0
Happy0
Sleepy0
Angry0
Dead0
Wink0
Previous Article SuiteWorld 2025: Oracle NetSuite Next bets everything on AI
Next Article NewDays raises additional $4.5M for platform that uses generative AI to treat people with dementia
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Stay Connected

248.1k Like
69.1k Follow
134k Pin
54.3k Follow

Latest News

4 devices I refuse to use without an Ethernet connection
News
Embracing the Uncertainty of Chaos-Driven Testing: Integration Tests That Can Destroy and Rebuild | HackerNoon
Computing
You can now grab the Pixel 10 Pro Fold for just $799 with this massive T-Mobile deal
News
Huawei’s 2023 global sales revenue hits nearly 98 billion dollars, up by 9.63% y-o-y · TechNode
Computing

You Might also Like

News

4 devices I refuse to use without an Ethernet connection

7 Min Read
News

You can now grab the Pixel 10 Pro Fold for just $799 with this massive T-Mobile deal

3 Min Read
News

Today's NYT Connections Hints, Answers for Oct. 10 #852

3 Min Read
News

iOS 26 Liquid Glass Design Copied by Android Smartphone Maker

7 Min Read
//

World of Software is your one-stop website for the latest tech news and updates, follow us now to get the news that matters to you.

Quick Link

  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact

Topics

  • Computing
  • Software
  • Press Release
  • Trending

Sign Up for Our Newsletter

Subscribe to our newsletter to get our newest articles instantly!

World of SoftwareWorld of Software
Follow US
Copyright © All Rights Reserved. World of Software.
Welcome Back!

Sign in to your account

Lost your password?