This AI Is Outranking Humans As A Top Software Bug Hunter

An AI program has climbed the leaderboard for discovering real-world software vulnerabilities, besting the human reviewers to take the top spot in the US.

The program, called Xbow, currently ranks number one on the US-based leaderboard at HackerOne, a platform that coordinates software vulnerability discoveries with major companies.

Over the last few months, Xbow has increased its reputation score at HackerOne by reporting over 1,000 apparent software flaws; 132 have been reported as officially discovered and resolved. Impacted companies include The Walt Disney Company, AT&T, Ford and Epic Games.

(Credit: HackerOne)

(Credit: Xbow)

In total, the AI program has submitted nearly 1,060 vulnerabilities, the startup behind Xbow announced on Tuesday. “All findings were fully automated,” the team added, “though our security team reviewed them pre-submission to comply with HackerOne’s policy on automated tools.”

While 132 of the flaws were officially resolved, another 303 were classified as “triaged,” meaning the reported bug has been acknowledged, but not resolved. Another 125 are pending review.

This Tweet is currently unavailable. It might be loading or has been removed.

So, it’s possible Xbow may have discovered an even larger crop of vulnerabilities that still need to be confirmed. But the AI program didn’t always find a new security issue; 208 of the submitted reports were marked as “duplicates” while another 209 were flagged as merely “informative.” The remaining 36 were declared not applicable.

Get Our Best Stories!

Stay Safe With the Latest Security News and Updates

Sign up for our SecurityWatch newsletter for our most important privacy and security stories delivered right to your inbox.

By clicking Sign Me Up, you confirm you are 16+ and agree to our Terms of Use and Privacy Policy.

Thanks for signing up!

Your subscription has been confirmed. Keep an eye on your inbox!

Still, the results show how new AI programs could shake up the cybersecurity industry through automated vulnerability discovery at a scale that outpaces human security researchers. “Notably, around 45% of Xbow’s findings are still awaiting resolution, highlighting the volume and impact of the submissions across live targets,” the Xbow team adds.

(Credit: Xbow)

In addition, technology promises to help companies stay ahead of malicious hackers who have also been trying to adopt generative AI. Xbow is designed to be fully autonomous, with the ability to complete “comprehensive penetration tests in just a few hours,” the company says.

But Xbow is also raising some concerns about whether it’s generating quantity over quality in terms of vulnerability reports. “Receiving hundreds of AI-generated bug reports would be so demoralizing and probably turn me off from maintaining an open source project forever,” wrote one user on the Hacker News forum. “I think developers are going to eventually need tools to filter out slop.”

Recommended by Our Editors

Brendan Dolan-Gavitt, an Xbow AI researcher, responded to the skepticism and criticism, writing: “The main difference is that all of the vulnerabilities reported here are real, many quite critical.” Others also point out the submissions from human security researchers on HackerOne can also be of low-quality.

HackerOne also chimed in on Xbow’s development: “AI is a force multiplier for crowdsourced security,” said Michiel Prins, HackerOne’s cofounder, in a statement. “Hackbot companies like Xbow are bringing impressive innovation to the space, accelerating how we discover and respond to vulnerabilities. But AI doesn’t learn to hack on its own—hackers train it. Human researchers remain essential partners in this feedback loop, and while AI leads in volume, we still see humans delivering the findings with the greatest business impact. Hackbots are simply the next step in an evolution driven by human ingenuity harnessing automation.”

Xbow released the results as it’s trying to sell its technology to customers. Bloomberg reports that Xbox recently raised $75 million through a new funding round.

About Michael Kan

Senior Reporter

I’ve been working as a journalist for over 15 years—I got my start as a schools and cities reporter in Kansas City and joined PCMag in 2017.

Read Michael’s full bio

This AI Is Outranking Humans as a Top Software Bug Hunter

Stay Safe With the Latest Security News and Updates

Recommended by Our Editors

About Michael Kan

Senior Reporter

Read the latest from Michael Kan

Leave a Reply

Stay Safe With the Latest Security News and Updates

Recommended by Our Editors

About Michael Kan

Senior Reporter

Read the latest from Michael Kan

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.

Leave a Reply Cancel reply

Leave a Reply