Sunday, May 31, 2026
The BLOCKCHAIN Page
No Result
View All Result
  • Home
  • Cryptocurrency
  • Blockchain
  • Bitcoin
  • Market & Analysis
  • Altcoins
  • DeFi
  • Ethereum
  • Dogecoin
  • XRP
  • Regulations
  • NFTs
The BLOCKCHAIN Page
No Result
View All Result
Home NFTs & Metaverse

Reddit blocks the Internet Archive from crawling its data – here’s why

by admin
August 12, 2025
in NFTs & Metaverse
0
Reddit blocks the Internet Archive from crawling its data – here’s why
0
SHARES
7
VIEWS
Share on FacebookShare on Twitter


gettyimages-2215157577

Andriy Onufriyenko/Getty Photographs

ZDNET’s key takeaways

  • The Web Archive can now solely crawl Reddit’s homepage.
  • Reddit’s objective is to dam AI corporations from scraping Reddit consumer knowledge.
  • Publishers (and others) are suing AI firms for copyright infringement.

Reddit is defending its privateness from AI firms which are taking roundabout approaches to scraping its content material.

The social media platform, often known as a useful resource the place customers can put up anonymously and discover details about just about any topic, will block the Web Archive’s Wayback Machine from indexing its on-line knowledge, based on a Monday report from The Verge. The transfer is in response to the invention that AI corporations, unable to scrape knowledge from Reddit instantly because of the platform’s prohibitive insurance policies, have as an alternative been retrieving its knowledge from listed content material on the Web Archive and utilizing it to coach fashions.

The Wayback Machine will now solely be capable to scrape knowledge from Reddit’s homepage, based on The Verge, whereas entry to consumer profiles, feedback, and put up element pages can be blocked.

Launched in 1996, the Web Archive is a non-profit that operates an unlimited digital database of net content material. The archive is maintained partially by the Wayback Machine, a bit of web-crawling software program that gathers net pages and preserves them as they appeared after they have been collected, like digital flies in amber. This serves as a useful resource for researchers finding out the evolution of on-line tradition and digital forensic proof for legislation enforcement, amongst different makes use of.

What Reddit’s transfer means

Reddit has beforehand flagged issues associated to the scraping of its content material with the Web Archive, based on The Verge. The non-profit was additionally reportedly notified earlier than the web-crawling restrictions began going into impact yesterday.

The Web Archive has but to make an official assertion about the way it plans to reply to Reddit’s new restrictions, and on the time of writing, it has not responded to ZDNET’s request for remark. Wayback Machine director Mark Graham, nevertheless, has informed a number of publications that the Web Archive will “proceed to have ongoing discussions about this matter” with Reddit.

Rising stress

Reddit’s reported determination to dam Wayback Machine from scraping the vast majority of its content material arrives throughout a second of mounting stress between AI firms and digital publishers, although Reddit is the primary tech firm to wade into the talk. The corporate sued Anthropic in June after discovering that the AI firm was illegally scraping its knowledge, nevertheless it has additionally beforehand signed licensing offers with each Google and OpenAI.

(Disclosure: Ziff Davis, ZDNET’s guardian firm, filed an April 2025 lawsuit in opposition to OpenAI, alleging it infringed Ziff Davis copyrights in coaching and working its AI methods.) 

AI builders require entry to gargantuan troves of knowledge to coach generative AI fashions, that are designed to determine and replicate refined mathematical patterns gleaned from these coaching datasets.

Lots of these firms have scraped coaching knowledge from publicly obtainable web sites, together with social media websites and information retailers, claiming authorized immunity below an idea identified in copyright legislation as fair use. (The courts are nonetheless untangling the legitimacy of that argument, and can seemingly be doing so for a while.)

Lots of the organizations whose content material has been copiously scraped — together with a cohort of authors and different artists — have responded with lawsuits. 

Others, in the meantime, have signed content material licensing agreements with the likes of OpenAI, Anthropic, and Google, consenting to using their organizations’ knowledge in alternate for elevated visibility within the responses generated by chatbots, or different advantages.





Source link

Tags: ArchiveblockscrawlingDataHeresinternetReddit
admin

admin

Recommended

Bitcoin Drops by 14% in the 2nd Quarter!

Bitcoin Drops by 14% in the 2nd Quarter!

2 years ago
Devcon On-Chain Raffle & Auction Participants

Live: Devcon5 Final Ticket Appeals

3 years ago

Popular News

  • Protocol-Owned Liquidity: A Sustainable Path for DeFi

    Protocol-Owned Liquidity: A Sustainable Path for DeFi

    0 shares
    Share 0 Tweet 0
  • Cryptocurrency for College: Exploring DeFi Scholarship Models

    0 shares
    Share 0 Tweet 0
  • What are rebase tokens, and how do they work?

    0 shares
    Share 0 Tweet 0
  • What is Velodrome Finance (VELO): why it’s a next-gen AMM

    0 shares
    Share 0 Tweet 0
  • $10 XRP Price Envisioned By Fund Manager As Ripple Mounts Trillion-Dollar Payment Markets ⋆ ZyCrypto

    0 shares
    Share 0 Tweet 0

Latest

I tried Microsoft’s Windows 365 Cloud PC on MacOS, Android, and iOS – here’s what it’s like

I tried Microsoft’s Windows 365 Cloud PC on MacOS, Android, and iOS – here’s what it’s like

May 30, 2026
ReMarkable Paper Pure vs. Boox Go 10.3: I used both tablets at work, and it comes down to this

ReMarkable Paper Pure vs. Boox Go 10.3: I used both tablets at work, and it comes down to this

May 30, 2026

Categories

  • Altcoins
  • Bitcoin
  • Blockchain
  • Cryptocurrency
  • DeFi
  • Dogecoin
  • Ethereum
  • Market & Analysis
  • NFTs & Metaverse
  • Regulations
  • XRP

Follow us

Recommended

  • I tried Microsoft’s Windows 365 Cloud PC on MacOS, Android, and iOS – here’s what it’s like
  • ReMarkable Paper Pure vs. Boox Go 10.3: I used both tablets at work, and it comes down to this
  • Amazon is selling this 75-inch Hisense TV for over $500 off – and I highly recommend it
  • Ripple Makes New Demands From SEC, What Are They Asking For?
  • This Lenovo laptop I tested rivals the MacBook Air in ways Windows PCs once struggled in
  • About us
  • Privacy Policy
  • Terms & Conditions

© 2023 TheBlockchainPage | All Rights Reserved

No Result
View All Result
  • Home
  • Cryptocurrency
  • Blockchain
  • Bitcoin
  • Market & Analysis
  • Altcoins
  • DeFi
  • Ethereum
  • Dogecoin
  • XRP
  • Regulations
  • NFTs

© 2023 TheBlockchainPage | All Rights Reserved