By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
World of SoftwareWorld of SoftwareWorld of Software
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Search
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
Reading: AI agents are broken. Is GPT-5 really the answer?
Share
Sign In
Notification Show More
Font ResizerAa
World of SoftwareWorld of Software
Font ResizerAa
  • Software
  • Mobile
  • Computing
  • Gadget
  • Gaming
  • Videos
Search
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Have an existing account? Sign In
Follow US
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
World of Software > News > AI agents are broken. Is GPT-5 really the answer?
News

AI agents are broken. Is GPT-5 really the answer?

News Room
Last updated: 2025/08/11 at 2:49 AM
News Room Published 11 August 2025
Share
SHARE

As 2025 dawned, OpenAI CEO Sam Altman was promoting two developments he insisted would transform our lives. One, of course, was GPT-5 — a long-anticipated major upgrade to the Large Language Model (LLM) that powered ChatGPT’s rise to tech world superstardom.

The other? AI Agents that don’t just answer your queries like ChatGPT, but actually get stuff done for you. “We believe that, in 2025, we may see the first AI agents join the workforce and materially change the output of companies,” Altman wrote back in January.

Well, we’re eight months in, and Altman’s prediction already needs a big old asterisk. Sure, companies are keen to adopt AI Agents, such as OpenAI’s ChatGPT agent. In a May 2025 report, consultancy giant PWC found that half of all firms surveyed planned to implement some kind of AI Agent by the end of the year. Some 88% of executives want to increase their teams’ AI budgets because of Agentic AI.

SEE ALSO:

GPT-5 arrives imminently. Here’s what the hype won’t tell you.

But what about the actual AI Agent experience? With apologies to all those hopeful executives, the reviews are almost uniformly negative.

If “AI Agents” was a new high-tech James Bond movie, here’s the kind of blurbs you’d see on Rotten Tomatoes: “glitchy … inconsistent” (Wired); “came off like a clueless internet newbie” (Fast Company); “reality doesn’t live up to the hype” (Fortune); “not matching up to the buzzwords” (Bloomberg), “the new vaporware … overpromising is worse than ever” (Forbes).

Study finds OpenAI’s entry failed nearly every time

A May 2025 Carnegie Mellon University study (PDF) found Google’s Gemini Pro 2.5 failed at real-world office tasks 70% of the time. And that was the best-performing agent. OpenAI’s entry, powered by GPT 4.o, failed more than 90% of the time.

GPT-5 is likely to improve on that number … but that’s not saying much. And not just because early reports say OpenAI struggled to fill GPT-5 with enough improvements to make it worthy of the release number.

Indeed, it’s starting to look to researchers like this disappointment is baked in to the whole process of LLMs learning to do stuff for you. The problem, as this AI Agent engineer’s analysis makes clear, is simple math: errors compound over time, so the more tasks an agent does, the worse they get. AI Agents who do multiple complex tasks are prone to hallucination, like all AI.

Mashable Light Speed

In the end some agents “panic” and can make “a catastrophic error in judgment,” to quote an apology from a Replit AI Agent that literally deleted a customer’s database after 9 days of working on a coding task. (Replit’s CEO called the failure “unacceptable”.)

Tellingly, that isn’t the only AI-Agent-wipes-code story of 2025 — which explains why one enterprising startup is offering insurance on your AI Agent going haywire, and why Wal-Mart has had to bring in four “super Agents” in a bid to corral its AI Agents.

No wonder a recent Gartner paper predicted that 40% of all those AI Agents currently being initiated by companies will be canceled within 2 years. “Most Agentic AI projects,” wrote senior analyst Anushree Verma, are “driven by hype and misapplied … This can blind organizations to the real cost and complexity of deploying AI agents at scale.”

What can GPT-5 do for AI Agents?

It’s possible that ChatGPT agent will vault to the top of the reliability charts once it’s powered by GPT-5. (Again, that’s not the highest of barriers.) But the new release is unlikely to fix what really ails the Agentic world.

That’s because guardrails are already being erected — by companies as well as regulators — shutting down what even the most reliable AI Agent can do for you.

Take Amazon, for example. The world’s largest retailer, like most tech giants, is talking a big game on AI Agents (as they did at a Shanghai Agentic AI fair in July, pictured above). At the same time, Amazon has shut down the ability of any AI Agent to browse and buy anywhere on its site.

That makes sense for Amazon, which has always wanted control over the customer experience, not to mention its desire to deliver ads and sponsored results to actual human eyeballs. But it’s also curtailing a massive amount of potential Agent activity right there. (On the plus side, no “catastrophic failure” involving a large pile of next-day deliveries at your door.)

And do we trust AI Agents to buy online for us anyway? It’s not that they’re evil and want to steal your credit card data; it’s that they’re naive and vulnerable to being phished by bad actors who do want your card.

Even GPT-5 may not be able to get around one vulnerability seen by researchers: data embedded in images can instruct AI agents to reveal any credit card info they might have, with the user being none the wiser.

If that kind of problem is exploited on a corporate scale, then Altman may be right about AI Agents “materially changing output” — just not in the way he meant.


Disclosure: Ziff Davis, Mashable’s parent company, in April filed a lawsuit against OpenAI, alleging it infringed Ziff Davis copyrights in training and operating its AI systems.

Topics
Artificial Intelligence
OpenAI

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.
By signing up, you agree to our Terms of Use and acknowledge the data practices in our Privacy Policy. You may unsubscribe at any time.
Share This Article
Facebook Twitter Email Print
Share
What do you think?
Love0
Sad0
Happy0
Sleepy0
Angry0
Dead0
Wink0
Previous Article WinRAR Zero-Day Under Active Exploitation – Update to Latest Version Immediately
Next Article Next Wave: Skipping the AI boom, for now
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Stay Connected

248.1k Like
69.1k Follow
134k Pin
54.3k Follow

Latest News

GnnunshShsnsfh2025xnnGvnnws
News
Safaricom cut fibre rates 25% as Starlink loses market share
Computing
5 Ways To Use Your TV’s USB Port To Level Up Your Device – BGR
News
Apple’s new Siri may allow users to operate apps just using voice | News
News

You Might also Like

GnnunshShsnsfh2025xnnGvnnws

0 Min Read
News

5 Ways To Use Your TV’s USB Port To Level Up Your Device – BGR

10 Min Read
News

Apple’s new Siri may allow users to operate apps just using voice | News

2 Min Read
News

Matter’s latest update doubles down on stability and pushes the platforms to play better together

12 Min Read
//

World of Software is your one-stop website for the latest tech news and updates, follow us now to get the news that matters to you.

Quick Link

  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact

Topics

  • Computing
  • Software
  • Press Release
  • Trending

Sign Up for Our Newsletter

Subscribe to our newsletter to get our newest articles instantly!

World of SoftwareWorld of Software
Follow US
Copyright © All Rights Reserved. World of Software.
Welcome Back!

Sign in to your account

Lost your password?