Xorte logo

News Markets Groups

USA | Europe | Asia | World| Stocks | Commodities



Add a new RSS channel

 
 


Keywords

2025-04-17 16:32:55| Engadget

Wikipedia has been struggling with the impact that AI crawlers bots that are scraping text and multimedia from the encyclopedia to train generative artificial intelligence models have been having on its servers, leading to increased costs and slower load times for human users in some cases. Perhaps in an effort to stop the bots from pummeling the public Wikipedia website and soaking up too much bandwidth, the Wikimedia Foundation (which manages Wikipedia's data) is offering AI developers a dataset they can freely use. The organization has teamed up with Kaggle, a data science platform, to offer up a beta release of a structured dataset in both English and French. According to Google which owns Kaggle the dataset is formatted for machine learning to make it more useful for training, development and data science. Wikimedia Enterprise notes that the dataset includes "abstracts, short descriptions, infobox-style key-value data, image links and clearly segmented article sections." There are no references or other "non-prose elements," such as video clips. The lack of references could make the issue of attribution for information in the dataset somewhat foggy. However, Wikimedia Enterprise (a part of the Wikimedia Foundation that seeks to make Wikipedia data available through APIs) says that the content in the dataset is freely licensed under Creative Commons, the public domain and so on since it's all from Wikipedia.This article originally appeared on Engadget at https://www.engadget.com/ai/wikipedia-offers-ai-developers-a-training-dataset-to-maybe-get-scraper-bots-off-its-back-143255593.html?src=rss


Category: Marketing and Advertising

 

Latest from this category

01.05Apple ordered to pay $502 million to Optis by UK courts
01.05Apples iPad Air M3 is $100 off
01.05Apple sends spyware warnings to iPhone users in 100 countries
01.05ASUS adds, then removes, the ability to detect sagging in its latest ROG Astral GPUs
01.05Microsoft is raising prices on the Xbox Series S and Series X
01.05Sam Altman's eyeball-scanning ID technology debuts in the US
01.05Borderlands 4 will have individual difficulty settings for co-op players
01.05Rivian R1S Gen 2 review: The rugged foundation of Rivians electric empire
Marketing and Advertising »

All news

01.05Harrods latest retailer to be hit by cyber attack
01.05Xbox prices hiked worldwide amid tariff uncertainty
01.05Apple ordered to pay $502 million to Optis by UK courts
01.05Over 2 million Ninja-branded pressure cookers are recalled after reports of serious burn injuries
01.05May Day rally underway in Union Park with march planned to Grant Park later in the day
01.05What to know about May Day, including its Chicago origins and how it has grown over the years
01.05Patrona Corporation
01.05Apples iPad Air M3 is $100 off
More »
Privacy policy . Copyright . Contact form .