By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
World of SoftwareWorld of SoftwareWorld of Software
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Search
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
Reading: Up to 25 percent distorted: Microsoft researchers warn against letting AI process large documents
Share
Sign In
Notification Show More
Font ResizerAa
World of SoftwareWorld of Software
Font ResizerAa
  • Software
  • Mobile
  • Computing
  • Gadget
  • Gaming
  • Videos
Search
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Have an existing account? Sign In
Follow US
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
World of Software > Gadget > Up to 25 percent distorted: Microsoft researchers warn against letting AI process large documents
Gadget

Up to 25 percent distorted: Microsoft researchers warn against letting AI process large documents

News Room
Last updated: 2026/05/15 at 9:09 AM
News Room Published 15 May 2026
Share
Up to 25 percent distorted: Microsoft researchers warn against letting AI process large documents
SHARE

AI providers such as OpenAI, Anthropic and Google promise that their models can capture and process large documents in a very short time. The aim is not only to make employees more productive – but also to partially replace them. However, a recent Microsoft study comes to a sobering conclusion: even the most powerful models make more and more errors with complex documents over time.

Share of AI-related layoffs is increasing

According to the consulting firm Challenger, Gray & Christmas, a quarter of all terminations can now be attributed to AI. In an earlier evaluation based on data from November 2025, the proportion was still less than one percent. More and more companies are openly admitting that they are cutting jobs through AI. Cloudflare also recently announced that it would lay off 20 percent of its workforce. “The move is not a cost-cutting measure or an assessment of individual performance. It is about Cloudflare defining how a world-class, high-growth company operates and creates value in the age of agent-based AI,” it said in a post about the layoffs.

Editorial recommendations

However, the Microsoft researchers found that large language models increasingly distort documents over long workflows – in the worst case, data is lost and the models hallucinate. In order to simulate long workflows in 52 specialist areas, the team developed the Delegate-25 tool. They used it to test 19 language models, including Gemini 3.1 Pro from Google, Claude Opus 4.6 from Anthropic and GPT-5.4 from OpenAI. The result: On average, 25 percent of the content of the top models mentioned was adulterated. For other models it was even more than half.

How reliable are AI tools really?

“Delegation requires trust,” say the three Microsoft researchers Philippe Laban, Tobias Schnabel and Jennifer Neville. “Our analysis shows that current language models are unreliable delegates. They cause rare but serious errors that corrupt documents unnoticed and accumulate over long interaction times.” The error rate depended on the specialist area: the models performed better when programming than in other applications. The researchers defined an accuracy of 98 percent after 20 interactions as the minimum standard for use in a specific area. Most models only achieved this value in a single area – namely Python programming. Gemini 3.1 Pro achieved the best performance, meeting the standard in eleven out of 52 areas.

Recommended editorial content

Here you can find external content from Podigee GmbHwhich complement our editorial offering on . By clicking “Show content” you agree that we can show you content from. now and in the future Podigee GmbH may display on our pages. Personal data may be transmitted to third-party platforms.

Note on data protection

Unfortunately something went wrong…

At this point you will usually find external content from Podigee GmbHbut we were unable to retrieve your consent settings.
Reload the page or adjust your consent settings manually.

This is still a preliminary study version that still needs to be assessed. Nevertheless, the researchers find clear words. “Large language models are not yet ready for delegated workflows in the vast majority of areas. In 80 percent of our simulated conditions, the models severely distorted documents,” said the research team. What is striking is that the errors were not caused by constant small inaccuracies, but by sudden, massive data losses. “More powerful models do not avoid small errors better, but rather delay critical failures and experience them in fewer interactions,” the study says. However, progress can be seen: When comparing GPT-4o and GPT-5.4, the accuracy increased from 14.7 to 71.5 percent.

Recommended editorial content

Here you can find external content from TargetVideo GmbHwhich complement our editorial offering on . By clicking “Show content” you agree that we can show you content from. now and in the future TargetVideo GmbH may display on our pages. Personal data may be transmitted to third-party platforms.

Note on data protection

Unfortunately something went wrong…

At this point you will usually find external content from TargetVideo GmbHbut we were unable to retrieve your consent settings.
Reload the page or adjust your consent settings manually.

Researchers complain about unreliability

An Asana study also shows that users are skeptical about the technology: Although 77 percent of employees already use AI agents, almost two thirds consider the systems to be unreliable. “Crucially, users who delegate work may lack the expertise or time to review changes implemented by the model and need to trust that it will not cause undetected errors such as hallucinations or deletion,” the researchers said. Nevertheless, it is still necessary to monitor AI systems closely.

Top Article

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.
By signing up, you agree to our Terms of Use and acknowledge the data practices in our Privacy Policy. You may unsubscribe at any time.
Share This Article
Facebook Twitter Email Print
Share
What do you think?
Love0
Sad0
Happy0
Sleepy0
Angry0
Dead0
Wink0
Previous Article Comparison test: Four 700 euro notebooks against the MacBook Neo Comparison test: Four 700 euro notebooks against the MacBook Neo
Next Article the wheels fall off the wheels fall off
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Stay Connected

248.1k Like
69.1k Follow
134k Pin
54.3k Follow

Latest News

Roland-Garros will serve as a crash test for France against IPTV piracy
Roland-Garros will serve as a crash test for France against IPTV piracy
Mobile
the wheels fall off
the wheels fall off
Gaming
Comparison test: Four 700 euro notebooks against the MacBook Neo
Comparison test: Four 700 euro notebooks against the MacBook Neo
Software
Amazon developers cheat on AI use | Computer Week
Amazon developers cheat on AI use | Computer Week
News

You Might also Like

Weekly podcast about Claude Mythos Preview and cybersecurity
Gadget

Weekly podcast about Claude Mythos Preview and cybersecurity

0 Min Read
New calendar functions and Claude: What Microsoft is planning for Outlook in May
Gadget

New calendar functions and Claude: What Microsoft is planning for Outlook in May

0 Min Read
Fatal overdose after ChatGPT council: Parents of deceased teenager sue OpenAI
Gadget

Fatal overdose after ChatGPT council: Parents of deceased teenager sue OpenAI

4 Min Read
This is how AI and robots are changing artificial insemination
Gadget

This is how AI and robots are changing artificial insemination

0 Min Read
//

World of Software is your one-stop website for the latest tech news and updates, follow us now to get the news that matters to you.

Quick Link

  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact

Topics

  • Computing
  • Software
  • Press Release
  • Trending

Sign Up for Our Newsletter

Subscribe to our newsletter to get our newest articles instantly!

World of SoftwareWorld of Software
Follow US
Copyright © All Rights Reserved. World of Software.
Welcome Back!

Sign in to your account

Lost your password?