Reddit will update web standard to block automated website scraping From Reuters

(Reuters) – Social media platform Reddit said on Tuesday it would update the web standard the platform uses to block automated data collection from its website, following reports that artificial intelligence startups were circumventing the rule to scrape content for their systems.

The move comes at a time when artificial intelligence companies are accused of plagiarizing content from publishers to create AI-generated summaries without attribution or asking for permission.

Reddit said it would update its robot exclusion protocol, or robots.txt, a widely used standard designed to determine which parts of a site are allowed to be crawled.

The company also said it will maintain rate limiting, a method used to control the number of requests from one specific entity, and will block unknown bots and crawlers from scraping — collecting and storing raw information — on its website.

More recently, the robots.txt file has become a key tool that publishers use to stop tech companies from freely using their content to train AI algorithms and create summaries in response to some search queries.

Last week, content licensing startup TollBit sent a letter to publishers saying that several artificial intelligence companies were circumventing web standards for scraping publisher sites.

This follows a Wired investigation that found AI search startup Perplexity may have bypassed attempts to block its web crawler via robots.txt.

Earlier in June, business media publisher Forbes accused Perplexity of plagiarizing its investigations for use in generative artificial intelligence systems without giving due credit.

Reddit said Tuesday that researchers and organizations such as the Internet Archive will continue to have access to its content for non-commercial use.

Source link

What's Hot

Massive Ethereum Buying Spree – Taker Buy Volume hits $1.683B In One Hour

How Top Cryptocurrencies Are Powering NFTs in Unprecedented Ways

Ethereum Price at Risk? Bearish Chart Patterns Warning

Reddit will update web standard to block automated website scraping From Reuters

Trump’s World Liberty Financial Adopts Chainlink Standard to Drive DeFi Expansion

Social Media Giant Reddit Sells Bitcoin (BTC) and Crypto Stash for Nearly $7,000,000

Ethereum’s Purge update: Here’s what it means for ETH

Reddit offloads majority of its Bitcoin holdings: Here’s what happened

Trump-backed World Liberty Financial stumbles at launch, website goes offline

Standard Chartered Analysts Says Ethereum Price Will Reach $10,000 If This Happens

Trader Says Memecoin That’s Up Over 15,000% Year-to-Date Still Looks Pretty Strong, Updates Outlook on Bitcoin

Will Fetch.AI Price Skyrocket to $1.55 or Crash Below $1.00?

Is Bitcoin (BTC) Price Heading Back to $40,000 in September?

Swell and OKX Partner to Boost DeFi and Layer 2 Solutions

Crypto Market Crash: How Low Can ADA Drop for Discounted Accumulation and Gains?

Y/PROJECT x Arianee Launches Blockchain-Based Digital Product Passports for Denim

Bitcoin: How whales are helping BTC stay above $70K

Our Picks

Massive Ethereum Buying Spree – Taker Buy Volume hits $1.683B In One Hour

How Top Cryptocurrencies Are Powering NFTs in Unprecedented Ways

Ethereum Price at Risk? Bearish Chart Patterns Warning

Learn

Zcash (ZEC) Price Prediction 2024 2025 2026 2027

What is the Bitcoin Halving? How Bitcoin’s Supply is Limited.

Hoge Finance (HOGE) Price Prediction 2024 2025 2026 2027

What's Hot

Reddit will update web standard to block automated website scraping From Reuters

Related Posts