By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
World of SoftwareWorld of SoftwareWorld of Software
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Search
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
Reading: Reddit says its blocking the Internet Archive to stop sneaky AI scrapers accessing its content – News
Share
Sign In
Notification Show More
Font ResizerAa
World of SoftwareWorld of Software
Font ResizerAa
  • Software
  • Mobile
  • Computing
  • Gadget
  • Gaming
  • Videos
Search
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Have an existing account? Sign In
Follow US
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
World of Software > News > Reddit says its blocking the Internet Archive to stop sneaky AI scrapers accessing its content – News
News

Reddit says its blocking the Internet Archive to stop sneaky AI scrapers accessing its content – News

News Room
Last updated: 2025/08/11 at 10:00 PM
News Room Published 11 August 2025
Share
SHARE

Reddit Inc. said today it has decided to block the Internet Archive from indexing its popular web forums in order to prevent sneaky artificial intelligence firms from scraping its content for training purposes.

Reddit reportedly found evidence that AI companies were scraping its content via the Internet Archive’s platform, after it restricted them from doing so using its official website. The decision means that the organization’s popular Wayback Machine service will no longer be able to archive Reddit pages, threads, profiles or comments – nothing, except for what’s shown on its homepage.

A report in The Verge means that, going forward, the archive will only be able to show what posts and news headlines were popular on any given day. Previously, Wayback Machine was able to archive every single page, documenting everything that was posted onto the “front page of the internet,” as Reddit proclaims itself to be.

Reddit did not say which AI companies were using the Wayback Machine to get around its prohibitions on them scraping its content. A spokesperson for the company told The Verge that it has “become aware of instances where AI companies violate platform policies… and scrape data from the Wayback Machine.”

The company seems to think that the Internet Archive should be taking steps to prevent this scraping, so there’s hope that the decision won’t be a permanent one. However, the report also highlights a concern by Reddit that Wayback Machine has a tendency to archive user’s posts and comments that are later deleted, saying that this is problematic for user privacy.

“Until they’re able to defend their site and comply with platform policies, we’re limiting some of their access to Reddit data to protect redditors,” the company said.

Although Reddit raises the issue of user privacy, it’s likely that its primary motivation for blocking the scrapers is money. AI companies are expressly prohibited from crawling its website, unless they’re willing to pay to access that data. Several companies have taken Reddit up on that offer, notably Google LLC and OpenAI.

Reddit has never revealed how much its deal with OpenAI is worth, but the agreement with Google is reportedly worth around $60 million. Reddit has also stated previously that it hopes to generate as much as $200 million from such licensing agreements over the next three years.

One company that doesn’t seem prepared to pay up is Anthropic PBC. In June, Reddit filed a lawsuit against it, saying it was continuing to scrape its content even after it claimed it was no longer doing so.

The Internet Archive isn’t the first organization to be blocked by Reddit over scraping concerns. In June 2024, the social media firm said it had blocked Microsoft Corp.’s Bing and smaller search engines, such as DuckDuckGo, Mojeek and Qwant, in order to prevent its content being scraped through their archives.

It’s not immediately clear if the Internet Archive will try and take steps to prevent its archives from being scraped so it can get Reddit’s restrictions lifted. In a statement, Wayback Machine Director Mark Graham said his team is engaged in “ongoing discussions about this matter.”

Image: News/Microsoft Designer

Support our mission to keep content open and free by engaging with theCUBE community. Join theCUBE’s Alumni Trust Network, where technology leaders connect, share intelligence and create opportunities.

  • 15M+ viewers of theCUBE videos, powering conversations across AI, cloud, cybersecurity and more
  • 11.4k+ theCUBE alumni — Connect with more than 11,400 tech and business leaders shaping the future through a unique trusted-based network.

About News Media

News Media is a recognized leader in digital media innovation, uniting breakthrough technology, strategic insights and real-time audience engagement. As the parent company of News, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — with flagship locations in Silicon Valley and the New York Stock Exchange — News Media operates at the intersection of media, technology and AI.

Founded by tech visionaries John Furrier and Dave Vellante, News Media has built a dynamic ecosystem of industry-leading digital media brands that reach 15+ million elite tech professionals. Our new proprietary theCUBE AI Video Cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.
By signing up, you agree to our Terms of Use and acknowledge the data practices in our Privacy Policy. You may unsubscribe at any time.
Share This Article
Facebook Twitter Email Print
Share
What do you think?
Love0
Sad0
Happy0
Sleepy0
Angry0
Dead0
Wink0
Previous Article Why Your A/B Testing Strategy is Broken (and How to Fix It) | HackerNoon
Next Article Hols hotspot retreat busted for giving tourist TOAD POISON for ‘astral journeys’
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Stay Connected

248.1k Like
69.1k Follow
134k Pin
54.3k Follow

Latest News

10 Best Free AI Meeting Note Taker Tools for Meetings in 2025
Computing
The geopolitics of semiconductors, Robin Saxby, former CEO of Arm – UKTN
News
Creditcoin’s Bold Plan to Make the World’s Invisible $2 Trillion Economy Visible | HackerNoon
Computing
Starlink Cuts Monthly Fee in Latest Bid to Attract New US Customers
News

You Might also Like

News

The geopolitics of semiconductors, Robin Saxby, former CEO of Arm – UKTN

2 Min Read
News

Starlink Cuts Monthly Fee in Latest Bid to Attract New US Customers

4 Min Read
News

TDK backs Ultraviolette with $21M to take India-made electric motorcycles global | News

8 Min Read
News

The best cheap VPN in August 2025

8 Min Read
//

World of Software is your one-stop website for the latest tech news and updates, follow us now to get the news that matters to you.

Quick Link

  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact

Topics

  • Computing
  • Software
  • Press Release
  • Trending

Sign Up for Our Newsletter

Subscribe to our newsletter to get our newest articles instantly!

World of SoftwareWorld of Software
Follow US
Copyright © All Rights Reserved. World of Software.
Welcome Back!

Sign in to your account

Lost your password?