By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
World of SoftwareWorld of SoftwareWorld of Software
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Search
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
Reading: Grok’s First Vibe-Coding Agent Has a High ‘Dishonesty Rate’
Share
Sign In
Notification Show More
Font ResizerAa
World of SoftwareWorld of Software
Font ResizerAa
  • Software
  • Mobile
  • Computing
  • Gadget
  • Gaming
  • Videos
Search
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Have an existing account? Sign In
Follow US
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
World of Software > News > Grok’s First Vibe-Coding Agent Has a High ‘Dishonesty Rate’
News

Grok’s First Vibe-Coding Agent Has a High ‘Dishonesty Rate’

News Room
Last updated: 2025/08/31 at 10:47 PM
News Room Published 31 August 2025
Share
SHARE

Don’t miss out on our latest stories. Add PCMag as a preferred source on Google.


Elon Musk’s xAI released its first agentic coding model, which claims to be “speedy and economical. ” However, it also has “a higher dishonesty rate” than the company’s flagship chatbot model, Grok 4.

The AI startup designed the new model, grok-code-fast-1, specifically for coding tasks. It’s free now for a limited time and accessible within GitHub Copilot, Cursor, Cline, Roo Code, Kilo Code, opencode, and Windsurf. “Grok-code-fast-1 has mastered the use of common tools like grep, terminal, and file editing, and thus should feel right at home in your favorite IDE,” xAI says.

But its propensity not to tell the truth could create problems for users. “We find that the dishonesty rate exceeds that of Grok 4,” says the model card. The company attributes this in part to its “safety training, which teaches the model to answer all queries that do not express [a] clear intent to engage in specified prohibited activities.”

Translation: if it doesn’t know the answer to your question, it might lie.

If programmers ask the model if a certain part of the codebase is working, and it doesn’t know, it may say “yes,” when, in fact, the opposite is true. It might also confirm that it completed a test the engineer asked it to do when it did not. This could create blind spots and double work.

Newsletter Icon

Get Our Best Stories!

Your Daily Dose of Our Top Tech News


What's New Now Newsletter Image

Sign up for our What’s New Now newsletter to receive the latest news, best new products, and expert advice from the editors of PCMag.

Sign up for our What’s New Now newsletter to receive the latest news, best new products, and expert advice from the editors of PCMag.

By clicking Sign Me Up, you confirm you are 16+ and agree to our Terms of Use and Privacy Policy.

Thanks for signing up!

Your subscription has been confirmed. Keep an eye on your inbox!

It’s not a major concern for xAI, which says it doesn’t expect the model “to be widely used as a general-purpose assistant,” like ChatGPT or the Grok chatbot.

Vibe-coding agents are a new trend that stands to revolutionize the field, but they’re far from perfect. One tool deleted a startup’s entire client database on its own and deceived the user multiple times along the way. In fact, most of the large language models in the market today have behavioral issues, including blackmail, sabotage, lying, and telling the user what they want to hear (sycophancy). In a recent test, Anthropic and OpenAI examined each other’s models and found these issues in almost all of them.

Another eye-catching part of the Grok Code Fast 1 model card discusses the risk of someone using it to develop biological weapons. The company tested for this before release, along with issues related to cybersecurity and chemical knowledge. But bioweapons are the biggest risk, and “have the potential for the greatest scale of harm, [since] frontier models significantly lower the barrier to entry to the creation of bioweapons,” xAI says.

Recommended by Our Editors

The results showed that Grok Code Fast 1 was worse than a human at “identifying issues in biological protocols,” but it was better at “troubleshooting wet lab virology experiments.” Again, xAI downplayed the issue, claiming that since the capabilities are similar to Grok 4, the new model “does not meaningfully change the risk landscape.”

Earlier this month, Anthropic updated the usage policy of its Claude chatbot to forbid using it to “synthesize, or otherwise develop, high-yield explosives or biological, chemical, radiological, or nuclear weapons or their precursors.”

Grok Code Fast 1 has secretly been out in the wild for the past week under the code name sonic. The xAI team says it “carefully monitored” feedback and deployed fixes, and plans to keep up a high rate of improvements “in days rather than weeks.” At the same time, lying seems to be a particularly tough problem for AI companies to completely solve, at least in the short term.

5 Ways to Get More Out of Your ChatGPT Conversations

PCMag Logo

5 Ways to Get More Out of Your ChatGPT Conversations

About Emily Forlini

Senior Reporter

Emily Forlini

I’m the expert at PCMag for all things electric vehicles and AI. I’ve written hundreds of articles on these topics, including product reviews, daily news, CEO interviews, and deeply reported features. I also cover other topics within the tech industry, keeping a pulse on what technologies are coming down the pipe that could shape how we live and work.

Read Emily’s full bio

Read the latest from Emily Forlini

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.
By signing up, you agree to our Terms of Use and acknowledge the data practices in our Privacy Policy. You may unsubscribe at any time.
Share This Article
Facebook Twitter Email Print
Share
What do you think?
Love0
Sad0
Happy0
Sleepy0
Angry0
Dead0
Wink0
Previous Article Top Tips for Maintaining Oral Health Between Visits
Next Article endure the weight of 96 trucks at the same time
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Stay Connected

248.1k Like
69.1k Follow
134k Pin
54.3k Follow

Latest News

Brazil’s first digital-only bank relies on Zscaler to roll out zero-trust security everywhere – News
News
DJI’s new Mic 3 aims to make multicam audio recording pain-free
News
Today's NYT Connections: Sports Edition Hints, Answers for Sept. 1 #343
News
The Best Smart Home Security Systems for 2025
News

You Might also Like

News

Brazil’s first digital-only bank relies on Zscaler to roll out zero-trust security everywhere – News

11 Min Read
News

DJI’s new Mic 3 aims to make multicam audio recording pain-free

3 Min Read
News

Today's NYT Connections: Sports Edition Hints, Answers for Sept. 1 #343

3 Min Read
News

The Best Smart Home Security Systems for 2025

36 Min Read
//

World of Software is your one-stop website for the latest tech news and updates, follow us now to get the news that matters to you.

Quick Link

  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact

Topics

  • Computing
  • Software
  • Press Release
  • Trending

Sign Up for Our Newsletter

Subscribe to our newsletter to get our newest articles instantly!

World of SoftwareWorld of Software
Follow US
Copyright © All Rights Reserved. World of Software.
Welcome Back!

Sign in to your account

Lost your password?