By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
World of SoftwareWorld of SoftwareWorld of Software
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Search
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
Reading: OpenAI Releases Operator, an AI Agent for Web-Based Tasks
Share
Sign In
Notification Show More
Font ResizerAa
World of SoftwareWorld of Software
Font ResizerAa
  • Software
  • Mobile
  • Computing
  • Gadget
  • Gaming
  • Videos
Search
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Have an existing account? Sign In
Follow US
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
World of Software > News > OpenAI Releases Operator, an AI Agent for Web-Based Tasks
News

OpenAI Releases Operator, an AI Agent for Web-Based Tasks

News Room
Last updated: 2025/02/18 at 9:26 AM
News Room Published 18 February 2025
Share
SHARE

OpenAI released a research preview of Operator, an AI agent that can use a web browser to perform tasks on a user’s behalf. Operator achieves new state-of-the-art performance on the WebArena and WebVoyager benchmarks.

To build Operator, OpenAI developed a new model called Computer-Using Agent (CUA), which is derived from GPT-4o. It relies on GPT-4o’s vision ability to understand the contents of a browser screen, and it is further trained to interact with GUI elements like buttons and menus. To perform a task, it iteratively loops through a series of perception, reasoning, and acting steps until the task is complete. OpenAI has built in several safety guardrails: for example, Operator will require the user to take over when entering passwords, and it will refuse some high-risk tasks such as banking transactions. According to OpenAI:

We have made significant progress in deep reasoning through the o-model series, vision capabilities through GPT-4o, and new techniques to improve robustness through reinforcement learning and instruction hierarchy. The next challenge space we plan to explore is expanding the action space of agents. The flexibility offered by a universal interface addresses this challenge, enabling an agent that can navigate any software tool designed for humans. By moving beyond specialized agent-friendly APIs, CUA can adapt to whatever computer environment is available—truly addressing the “long tail” of digital use cases that remain out of reach for most AI.

In late 2024, InfoQ covered Anthropic’s release of the Computer Use feature, which allows their Claude model to interact with a computer by interpreting the images on the screen, moving the mouse pointer, clicking buttons, and entering text via a virtual keyboard. Claude set records on several OS and web use benchmarks, but Operator outperforms it on WebArena, WebVoyager, and OSWorld. However, Operator still falls short of human performance on these tasks: for example, it scores 38.1% on OSWorld vs. over 70% for humans.

CUA Benchmark Scores. Image Source: OpenAI’s CUA Report

Because Operator can take actions on websites, OpenAI added several safety measures beyond those already built into GPT-4o. Particularly important are the safeguards against adversarial attacks by malicious websites, including prompt injection and phishing. OpenAI used red-teams to test the safeguards, and claim that their mitigation against prompt injection worked in “all but one case.”

AI researcher and entrepreneur Andrej Karpathy wrote about Operator on X:

Projects like OpenAI’s Operator are to the digital world as humanoid robots are to the physical world. One general setting (monitor keyboard and mouse, or human body) that can in principle gradually perform arbitrarily general tasks, via an I/O interface originally designed for humans. In both cases, it leads to a gradually mixed-autonomy world, where humans become high-level supervisors of low-level automation. A bit like a driver monitoring the Autopilot. This will happen faster in the digital world than in the physical world because flipping bits is somewhere around 1000X less expensive than moving atoms. Though the market size and opportunity feels a lot bigger in the physical world.

Operator is only available via the web for ChatGPT Pro users. OpenAI intends to expand this to other paid ChatGPT plans “once we are confident in its safety and usability at scale,” and to make the underlying CUA model available via API.

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.
By signing up, you agree to our Terms of Use and acknowledge the data practices in our Privacy Policy. You may unsubscribe at any time.
Share This Article
Facebook Twitter Email Print
Share
What do you think?
Love0
Sad0
Happy0
Sleepy0
Angry0
Dead0
Wink0
Previous Article https://news.google.com/read/CBMiY0FVX3lxTFB0SjVGQUZvdVRyMkRTNnpUSUZkSWQ0R3BlUWQ4eVBBWHZDVGNPU3A4c2pOUGlBVndMRVlGR2o1TGhRZ3pYdndhcjlPdkpjbVlnYVNuUVRvZDFteVhTMFp3QjJibw?hl=en-GB&gl=GB&ceid=GB%3Aen
Next Article South Korea plots to become home to world’s largest AI datacentre | Computer Weekly
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Stay Connected

248.1k Like
69.1k Follow
134k Pin
54.3k Follow

Latest News

MSI SME servers with CPUS AMD EPYC 4005 Series
Mobile
Apple TVs just got a big Dolby Atmos boost thanks to tvOS 18.5
News
SpaceX to FCC: We Can Supply a GPS Alternative Through Starlink
News
25+ Best Chrome Extensions for Productivity in 2025 |
Computing

You Might also Like

News

Apple TVs just got a big Dolby Atmos boost thanks to tvOS 18.5

2 Min Read
News

SpaceX to FCC: We Can Supply a GPS Alternative Through Starlink

5 Min Read
News

Apple might let you scroll with your eyes in the Vision Pro

2 Min Read

DoorDash delivery driver pleads guilty to stealing $2.5 million in deliveries scam

2 Min Read
//

World of Software is your one-stop website for the latest tech news and updates, follow us now to get the news that matters to you.

Quick Link

  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact

Topics

  • Computing
  • Software
  • Press Release
  • Trending

Sign Up for Our Newsletter

Subscribe to our newsletter to get our newest articles instantly!

World of SoftwareWorld of Software
Follow US
Copyright © All Rights Reserved. World of Software.
Welcome Back!

Sign in to your account

Lost your password?