By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
World of SoftwareWorld of SoftwareWorld of Software
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Search
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
Reading: Anthropic’s new Claude Opus 4 can run autonomously for seven hours straight
Share
Sign In
Notification Show More
Font ResizerAa
World of SoftwareWorld of Software
Font ResizerAa
  • Software
  • Mobile
  • Computing
  • Gadget
  • Gaming
  • Videos
Search
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Have an existing account? Sign In
Follow US
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
World of Software > News > Anthropic’s new Claude Opus 4 can run autonomously for seven hours straight
News

Anthropic’s new Claude Opus 4 can run autonomously for seven hours straight

News Room
Last updated: 2025/05/22 at 8:31 PM
News Room Published 22 May 2025
Share
SHARE

After whirlwind week of announcements from Google and OpenAI, Anthropic has its own news to share.

On Thursday, Anthropic announced Claude Opus 4 and Claude Sonnet 4, its next generation of models, with an emphasis on coding, reasoning, and agentic capabilities. According to Rakuten, which got early access to the model, Claude Opus 4 ran “independently for seven hours with sustained performance.”

Claude Opus is Anthropic’s largest version of the model family with more power for longer, complex tasks, whereas Sonnet is generally speedier and more efficient. Claude Opus 4 is a step up from its previous version, Opus 3, and Sonnet 4 replaces Sonnet 3.7.

Mashable Light Speed

Anthropic says Claude Opus 4 and Sonnet 4 outperform rivals like OpenAI’s o3 and Gemini 2.5 Pro on key benchmarks for agentic coding tasks like SWE-bench and Terminal-bench. It’s worth noting however, that self-reported benchmarks aren’t considered the best markers of performance since these evaluations don’t always translate to real-world use cases, plus AI labs aren’t into the whole transparency thing these days, which AI researchers and policy makers increasingly call for. “AI benchmarks need to be subjected to the same demands concerning transparency, fairness, and explainability, as algorithmic systems and AI models writ large,” said the European Commission’s Joint Research Center.

Opus 4 and Sonnet 4 outperform rivals in SWE-bench, but take benchmark performance with a grain of salt.
Credit: Anthropic

Alongside the launch of Opus 4 and Sonnet 4, Anthropic also introduced new features. That includes web search while Claude is in extended thinking mode, and summaries of Claude’s reasoning log “instead of Claude’s raw thought process.” This is described in the blog post as being more helpful to users, but also “protecting [its] competitive advantage,” i.e. not revealing the ingredients of its secret sauce. Anthropic also announced improved memory and tool use in parallel with other operations, general availability of its agentic coding tool Claude Code, and additional tools for the Claude API.

In the safety and alignment realm, Anthropic said both models are “65 percent less likely to engage in reward hacking than Claude Sonnet 3.7.” Reward hacking is a slightly terrifying phenomenon where models can essentially cheat and lie to earn a reward (successfully perform a task).

One of the best indicators we have in evaluating a model’s performance is users’ own experience with it, although even more subjective than benchmarks. But we’ll soon find out how Claude Opus 4 and Sonnet 4 chalk up to competitors in that regard.

Topics
Artificial Intelligence

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.
By signing up, you agree to our Terms of Use and acknowledge the data practices in our Privacy Policy. You may unsubscribe at any time.
Share This Article
Facebook Twitter Email Print
Share
What do you think?
Love0
Sad0
Happy0
Sleepy0
Angry0
Dead0
Wink0
Previous Article Honor 400 vs Google Pixel 9a: Comparing the mid-range Androids
Next Article Prompt or Perish: The New Rules of Work in the Age of AI and Vibe Coding
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Stay Connected

248.1k Like
69.1k Follow
134k Pin
54.3k Follow

Latest News

The best headphones in 2025
Software
Breaking Into Quant Trading: A Practical, No-Fluff Guide | HackerNoon
Computing
Why I’ll Take a Plastic Keyboard Over a Metal One Any Day
News
This Overlooked Hailee Steinfeld Apple TV+ Series Needs To Be On Your Watchlist – BGR
News

You Might also Like

News

Why I’ll Take a Plastic Keyboard Over a Metal One Any Day

9 Min Read
News

This Overlooked Hailee Steinfeld Apple TV+ Series Needs To Be On Your Watchlist – BGR

4 Min Read
News

Next week’s Google event is sounding more like a late-night talk show lineup

2 Min Read
News

I walked streets of Washington & saw scenes straight from disaster movie

6 Min Read
//

World of Software is your one-stop website for the latest tech news and updates, follow us now to get the news that matters to you.

Quick Link

  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact

Topics

  • Computing
  • Software
  • Press Release
  • Trending

Sign Up for Our Newsletter

Subscribe to our newsletter to get our newest articles instantly!

World of SoftwareWorld of Software
Follow US
Copyright © All Rights Reserved. World of Software.
Welcome Back!

Sign in to your account

Lost your password?