By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
World of SoftwareWorld of SoftwareWorld of Software
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Search
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
Reading: Grok Is the Worst Mainstream Chatbot at Sports Betting, Says Research
Share
Sign In
Notification Show More
Font ResizerAa
World of SoftwareWorld of Software
Font ResizerAa
  • Software
  • Mobile
  • Computing
  • Gadget
  • Gaming
  • Videos
Search
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Have an existing account? Sign In
Follow US
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
World of Software > News > Grok Is the Worst Mainstream Chatbot at Sports Betting, Says Research
News

Grok Is the Worst Mainstream Chatbot at Sports Betting, Says Research

News Room
Last updated: 2026/04/12 at 12:55 PM
News Room Published 12 April 2026
Share
Grok Is the Worst Mainstream Chatbot at Sports Betting, Says Research
SHARE

X’s chatbot Grok may have proved its ability to give hot takes on Nazi Germany or put almost anything in a bikini, but there’s one area that new research has found it majorly underperforming compared to its rivals: predicting sports results.

According to a report by AI start-up General Reasoning, first shared with The Financial Times, Grok performed the worst out of eight widely used large language models when it came to predicting and betting on the results of the 2023–24 Premier League season, the world’s most popular soccer league.

Eight LLMs were fed detailed historical data and statistics about each team and previous games. The LLMs were then told to build models that would maximize returns and manage risk when placing bets. Each LLM was given three tries at running the simulation, and a $133,000 (£100,000) pot to place bets with.

Anthropic’s Claude Opus 4.6 did the best of any chatbot tested, losing 11.0% on average over its three tries and ending with an average pot of £89,035.

X’s Grok, in contrast, lost all its money on one attempt and failed to complete its tasks on the next two attempts, finishing with an average final pot of zero. OpenAI’s GPT-5.4 also turned in a respectable, though still losing, performance. GPT-5.4 lost 13.6% on average, ending with a final average pot of $116,000 (£86,365). However, its worst try, where it lost 31.6%, was worse than any of Claude’s. Google’s Gemini 3.1 Pro recorded worse overall performance but with high variability, losing 43.3% on average, but returning 33.7% on its best attempt.

Recommended by Our Editors

The authors of the paper found, in general, that AI was “systematically underperforming humans” in its testing. Meanwhile, Ross Taylor, General Reasoning’s chief executive, said that despite the hype around AI automation, there is currently “not a lot of measurement of putting AI into a long-term horizon setting,” highlighting how a lot of current testing occurs in “very static environments” that don’t reflect the complexity of real life.

The news comes as Grok may soon see more corporate adoption, with xAI’s owner, Elon Musk, reportedly forcing banks working on the upcoming SpaceX IPO to subscribe to the tool.

Newsletter Icon

Get Our Best Stories!

Your Daily Dose of Our Top Tech News


What's New Now Newsletter Image

Sign up for our What’s New Now newsletter to receive the latest news, best new products, and expert advice from the editors of PCMag.

Sign up for our What’s New Now newsletter to receive the latest news, best new products, and expert advice from the editors of PCMag.

By clicking Sign Me Up, you confirm you are 16+ and agree to our Terms of Use and Privacy
Policy.

Thanks for signing up!

Your subscription has been confirmed. Keep an eye on your inbox!

About Our Expert

Will McCurdy


Experience

I’m a reporter covering weekend news. Before joining PCMag in 2024, I picked up bylines in BBC News, The Guardian, The Times of London, The Daily Beast, Vice, Slate, Fast Company, The Evening Standard, The i, TechRadar, and Decrypt Media.

I’ve been a PC gamer since you had to install games from multiple CD-ROMs by hand. As a reporter, I’m passionate about the intersection of tech and human lives. I’ve covered everything from crypto scandals to the art world, as well as conspiracy theories, UK politics, and Russia and foreign affairs.

Read Full Bio

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.
By signing up, you agree to our Terms of Use and acknowledge the data practices in our Privacy Policy. You may unsubscribe at any time.
Share This Article
Facebook Twitter Email Print
Share
What do you think?
Love0
Sad0
Happy0
Sleepy0
Angry0
Dead0
Wink0
Previous Article Bitcoin Layer-2 Options: How BTC Holders Get Rewards in 2026 Bitcoin Layer-2 Options: How BTC Holders Get Rewards in 2026
Next Article China’s Premier Li Qiang to deliver keynote at World Artificial Intelligence Conference 2024 · TechNode China’s Premier Li Qiang to deliver keynote at World Artificial Intelligence Conference 2024 · TechNode
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Stay Connected

248.1k Like
69.1k Follow
134k Pin
54.3k Follow

Latest News

iPhone shipments surge 40% y-o-y in May in China, driven by price cuts · TechNode
iPhone shipments surge 40% y-o-y in May in China, driven by price cuts · TechNode
Computing
Dell Pro Max Tower T2 Review: This Workstation Scales From Sensible to Savage
Dell Pro Max Tower T2 Review: This Workstation Scales From Sensible to Savage
News
Rust + OpenGL: Rendering 250,000 Dynamic 3D Entities at 50 FPS on a Single CPU Thread | HackerNoon
Rust + OpenGL: Rendering 250,000 Dynamic 3D Entities at 50 FPS on a Single CPU Thread | HackerNoon
Computing
Sunday Reboot: MacBook Neo upgrades, masses of Mac minis, and iPhone re-entry
Sunday Reboot: MacBook Neo upgrades, masses of Mac minis, and iPhone re-entry
News

You Might also Like

Dell Pro Max Tower T2 Review: This Workstation Scales From Sensible to Savage
News

Dell Pro Max Tower T2 Review: This Workstation Scales From Sensible to Savage

3 Min Read
Sunday Reboot: MacBook Neo upgrades, masses of Mac minis, and iPhone re-entry
News

Sunday Reboot: MacBook Neo upgrades, masses of Mac minis, and iPhone re-entry

1 Min Read
11 Major Projector Brands Ranked, According To User Reviews – BGR
News

11 Major Projector Brands Ranked, According To User Reviews – BGR

18 Min Read
Apple Reportedly Testing Glasses AI in Several Frame Styles
News

Apple Reportedly Testing Glasses AI in Several Frame Styles

3 Min Read
//

World of Software is your one-stop website for the latest tech news and updates, follow us now to get the news that matters to you.

Quick Link

  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact

Topics

  • Computing
  • Software
  • Press Release
  • Trending

Sign Up for Our Newsletter

Subscribe to our newsletter to get our newest articles instantly!

World of SoftwareWorld of Software
Follow US
Copyright © All Rights Reserved. World of Software.
Welcome Back!

Sign in to your account

Lost your password?