Monday, February 16, 2026
  • Hype
  • Murai
  • Lipstiq
  • Wanista
  • Varnam
  • Hangat
  • Autofreaks
Lowyat.NET
  • News
    • Lifestyle
    • Computing
    • Hardware
    • Internet
    • Rumours & Leaks
    • Software
  • Forums
    • Kopitiam
    • Tradezone
    • Property Talk
    • Finance & Business
    • Fast and Furious
  • Gaming
    • PC Gaming
    • Console
    • Esports
  • Mobile
    • Apps
    • OS
    • Tablets
    • Phones
    • Telco
      • Celcom
      • DiGi
      • Maxis
      • Tune Talk
      • U Mobile
      • Buzzme
  • Pricelists
    • Compu-zoneUpdated
    • ViewnetUpdated
    • Sri ComputersUpdated
    • StartecUpdated
  • More
    • Automotive Tech
    • Drone
    • Enterprise
    • Entertainment
    • Fashion
    • E-Hailing
    • Wearables
No Result
View All Result
Lowyat.NET
  • News
    • Lifestyle
    • Computing
    • Hardware
    • Internet
    • Rumours & Leaks
    • Software
  • Forums
    • Kopitiam
    • Tradezone
    • Property Talk
    • Finance & Business
    • Fast and Furious
  • Gaming
    • PC Gaming
    • Console
    • Esports
  • Mobile
    • Apps
    • OS
    • Tablets
    • Phones
    • Telco
      • Celcom
      • DiGi
      • Maxis
      • Tune Talk
      • U Mobile
      • Buzzme
  • Pricelists
    • Compu-zoneUpdated
    • ViewnetUpdated
    • Sri ComputersUpdated
    • StartecUpdated
  • More
    • Automotive Tech
    • Drone
    • Enterprise
    • Entertainment
    • Fashion
    • E-Hailing
    • Wearables
No Result
View All Result
Lowyat.NET
No Result
View All Result
Home Artifical Intelligence

Wikipedia Offers Dataset For AI Training To Deter Bot Scraping

And to ease the strain on its online encyclopedia's servers.

by Nurul Kamil
April 18, 2025
Wikipedia
Share on FacebookShare on Twitter

Wikipedia is releasing a dataset for training AI models as a means to dissuade bot scraping on its online encyclopedia. On Wednesday, The Wikimedia Foundation announced that it has partnered with Google-owned platform Kaggle to publish a beta dataset tailored for machine learning applications.

According to the organisation, the dataset in question comprises “structured Wikipedia content in English and French”, and as of 15 April includes openly licensed research summaries, short descriptions, image links, infobox data, and article sections. The dataset is intended to enable developers to gain access to machine-readable article data for modeling, fine-tuning, benchmarking, alignment, and analysis with ease.

AI stock
Image: Pexels

Previously, Wikimedia reported that AI bot crawlers were bombarding Wikipedia since the beginning of last year, causing strain on the online encyclopedia’s servers. The release of a dataset that’s specifically optimised for AI development would be a solution to this issue, as it should be a better alternative to scraping raw text. Additionally, the partnership with Kaggle would allow the data to become more accessible to independent entities and smaller companies.

For the uninitiated, Kaggle is a platform for machine learning practitioners, researchers, and data enthusiasts. Its partnerships lead, Brenda Flynn, stated that the company is eager to play a role in keeping the data hosted by the Wikimedia Foundation accessible.

(Source: The Verge)

Filed Under AIArtificial Intelligencewikimedia foundationWikipedia
Updated 4:57 pm, Fri, 18 April 25
https://lowy.at/2xhds
Share2Tweet1SendShare

Follow us on Instagram, Facebook, Twitter or Telegram for more updates and breaking news. 

No Result
View All Result

TRENDING THIS WEEK

  1. 1
    Editorial

    The Malaysian Who Sold AI.com For US$70 million, Did Not Purchase it for US$100 in 1993

  2. 2
    Streaming

    Astro To Drop HBO Channels From March, Introduces Four New Channels

  3. 3
    News

    Toyota Announces 2026 Corolla Cross HEV GR Sport; Priced In Malaysia From RM148,800

  4. 4
    Editorial

    The Kodak Charmera Is A Reminder To Just Enjoy The Little Things

  5. 5
    Streaming

    Astro One Sports And Epic Packs Offered At Discounted Prices Until 30 April 2026

NETWORK

  • Hype
  • Murai
  • Lipstiq
  • Wanista
  • Varnam
  • Hangat
  • Autofreaks

ABOUT

  • Advertise
  • Careers
  • Privacy Statement
  • Contact Us
  • Editorial Policy
  • Terms & Conditions

©2025 VIJANDREN RAMADASS. ALL RIGHTS RESERVED.

No Result
View All Result
  • News
    • Lifestyle
    • Computing
    • Hardware
    • Internet
    • Rumours & Leaks
    • Software
  • Forums
    • Kopitiam
    • Tradezone
    • Property Talk
    • Finance & Business
    • Fast and Furious
  • Gaming
    • PC Gaming
    • Console
    • Esports
  • Mobile
    • Apps
    • OS
    • Tablets
    • Phones
    • Telco
      • Celcom
      • DiGi
      • Maxis
      • Tune Talk
      • U Mobile
      • Buzzme
  • Pricelists
    • Compu-zone
    • Viewnet
    • Sri Computers
    • Startec
  • More
    • Automotive Tech
    • Drone
    • Enterprise
    • Entertainment
    • Fashion
    • E-Hailing
    • Wearables

©2026 VIJANDREN RAMADASS. ALL RIGHTS RESERVED.

No Result
View All Result
  • News
    • Lifestyle
    • Computing
    • Hardware
    • Internet
    • Rumours & Leaks
    • Software
  • Forums
    • Kopitiam
    • Tradezone
    • Property Talk
    • Finance & Business
    • Fast and Furious
  • Gaming
    • PC Gaming
    • Console
    • Esports
  • Mobile
    • Apps
    • OS
    • Tablets
    • Phones
    • Telco
      • Celcom
      • DiGi
      • Maxis
      • Tune Talk
      • U Mobile
      • Buzzme
  • Pricelists
    • Compu-zone
    • Viewnet
    • Sri Computers
    • Startec
  • More
    • Automotive Tech
    • Drone
    • Enterprise
    • Entertainment
    • Fashion
    • E-Hailing
    • Wearables

©2026 VIJANDREN RAMADASS. ALL RIGHTS RESERVED.