By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
World of SoftwareWorld of SoftwareWorld of Software
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Search
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
Reading: Controlling your screen comes to Gemini 3.5 Flash
Share
Sign In
Notification Show More
Font ResizerAa
World of SoftwareWorld of Software
Font ResizerAa
  • Software
  • Mobile
  • Computing
  • Gadget
  • Gaming
  • Videos
Search
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Have an existing account? Sign In
Follow US
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
World of Software > Computing > Controlling your screen comes to Gemini 3.5 Flash
Computing

Controlling your screen comes to Gemini 3.5 Flash

News Room
Last updated: 2026/06/25 at 10:30 PM
News Room Published 25 June 2026
Share
Controlling your screen comes to Gemini 3.5 Flash
SHARE

It is now a native integration for Computer Use in a Gemini Flash model, and in this case Gemini 3.5 Flash. This capability was previously only available through a dedicated model. With native integration, developers have a unified tool to create sophisticated agents.

With Computer Use in Gemini 3.5 Flash

With this update, Gemini 3.5 Flash can analyze the screen, understand visual context, and generate concrete actions like mouse clicks or keyboard inputs.

The model can thus navigate websites or fill out forms independently, deciding on the best actions to take. More generally, it is about seeing, reasoning andact in web, desktop and mobile browsing environments.

On the OSWorld benchmark, which evaluates such skills, Gemini 3.5 Flash achieves a score of 78.4 and approaches the leaders in the field for interaction tasks.

How does this interaction work?

The process is based on a continuous interaction loop. The AI ​​agent analyzes a screenshot of the GUI, whether it’s a browser, desktop app, or mobile app. The developer’s application must then perform these actions, capture the new screen state, and send it back to Gemini.

Based on the given goal, the model determines the next action to perform (click, scroll, etc.) and returns it for execution. This cycle continues until the task is completed, allowing automation without human intervention or a specific API.

Entrusting so much power to an AI?

Google subjected the model to adversarial training (adversarial training) in order to protect it against prompt injections, an attack technique aimed at diverting the AI ​​from its initial objective.

Two optional safeguards are offered to companies: the need for explicit confirmation from the user for sensitive or irreversible actions, and automatic termination of the task in the event of detection of an indirect attack.

It is possible to preview the capabilities of Computer Use in Gemini 3.5 Flash with a demo environment on Browserbase.

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.
By signing up, you agree to our Terms of Use and acknowledge the data practices in our Privacy Policy. You may unsubscribe at any time.
Share This Article
Facebook Twitter Email Print
Share
What do you think?
Love0
Sad0
Happy0
Sleepy0
Angry0
Dead0
Wink0
Previous Article Xbox: Microsoft increases console prices worldwide on August 1st Xbox: Microsoft increases console prices worldwide on August 1st
Next Article Every year a veterinarian, a jurist, a psychoanalyst and a gardener meet. They are the secret owners of rearmament in Europe Every year a veterinarian, a jurist, a psychoanalyst and a gardener meet. They are the secret owners of rearmament in Europe
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Stay Connected

248.1k Like
69.1k Follow
134k Pin
54.3k Follow

Latest News

From Donna to Britta: How a coach organizes her business with over 20 AI agents
From Donna to Britta: How a coach organizes her business with over 20 AI agents
Gadget
Using Visual Studio Code with local LLM – here’s how
Using Visual Studio Code with local LLM – here’s how
News
New in .NET 10.0 (29): Check IP addresses with IPAddress.IsValid()
New in .NET 10.0 (29): Check IP addresses with IPAddress.IsValid()
Software
Every year a veterinarian, a jurist, a psychoanalyst and a gardener meet. They are the secret owners of rearmament in Europe
Every year a veterinarian, a jurist, a psychoanalyst and a gardener meet. They are the secret owners of rearmament in Europe
Gaming

You Might also Like

Tesla launches global competition to offer free Supercharging for life
Computing

Tesla launches global competition to offer free Supercharging for life

4 Min Read
The PACT protocol will replace CAPTCHAs with the help of web giants
Computing

The PACT protocol will replace CAPTCHAs with the help of web giants

4 Min Read
the driver blames Autopilot, the data says otherwise
Computing

the driver blames Autopilot, the data says otherwise

5 Min Read
Alibaba attacks US over military blacklist
Computing

Alibaba attacks US over military blacklist

3 Min Read
//

World of Software is your one-stop website for the latest tech news and updates, follow us now to get the news that matters to you.

Quick Link

  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact

Topics

  • Computing
  • Software
  • Press Release
  • Trending

Sign Up for Our Newsletter

Subscribe to our newsletter to get our newest articles instantly!

World of SoftwareWorld of Software
Follow US
Copyright © All Rights Reserved. World of Software.
Welcome Back!

Sign in to your account

Lost your password?