Mom’s website ready to put OpenAI in a time-out after learning the AI firm may have scrapped its data

British parenting hub Mumsnet has filed a lawsuit against OpenAI, claiming it violated copyright law by using its data to train its AI models, including those powering ChatGPT. It’s the first such legal action taken against OpenAI in the United Kingdom, but one of a growing number of similar cases spread internationally accusing OpenAI of illicitly scraping information for its models without permission. Mumsnet claims its forums host more than six billion words and that OpenAI employed those words to teach its AI models about parenting and related topics.

“Such scraping without permission is an explicit breach of our terms of use, which clearly state that no part of the site may be distributed, scraped or copied for any purpose without our express approval,” Mumsnet co-founder Justine Roberts explained in a post on the website. “The LLMs are building models like ChatGPT to provide the answers to any and all prospective questions that will mean we’ll no longer need to go elsewhere for solutions. And they’re building those models with scraped content from the websites they are poised to replace.”

The legal complaint points to the timing of the data collection as another point of contention since it mainly happened before websites were paying close attention to whether AI companies were scraping their data. Mumsnet alleges that third-party research institutions initially performed the majority of this data scraping.

Roberts wrote that Mumsnet reached out to OpenAI about licensing its content, pointing out that the platform has a concentrated collection of writing by women that is unlike the majority of internet content. But, OpenAI turned them down, citing interest in “datasets that are not easily accessible online,” according to Roberts.

Scrape Scraps

Mumsnet is hardly alone in voicing complaints about OpenAI’s data scraping and is now part of an expanding cohort of companies taking OpenAI to court on the matter. For instance, the Authors Guild has sued OpenAI, alleging copyrighted books were used for training AI’s models, as have a group of academics claiming their articles were similarly lifted by OpenAI. Reuters and The New York Times have both sued OpenAI over not only data scraping but also by claiming ChatGPT generates responses with content far too close to their copyrighted articles. Even Creative Commons has filed suit against the AI developer, claiming that the company used Creative Commons-licensed content to train its AI models in ways that violated the terms of the licenses.

OpenAI has defended its practices as falling under the fair use doctrine. In the UK, the company responded to a House of Lords inquiry by acknowledging the necessity of using copyrighted materials for training its AI models and that it should do more to support content creators, but still maintains that what it does is legal. While this is OpenAI’s first UK case on the matter, Getty Images has a similar case going in the country’s courts against Stability AI for its image-generating AI.

The outcome of Mumsnet’s lawsuit and other cases may set precedents for how AI companies handle copyrighted content and might influence future regulations and licensing practices. The effort to balance AI innovation and intellectual property rights is far from settled and probably won’t be for a long while.

To be fair, Mumsnet isn’t against LLMs and AI as a concept. In fact, Mumsnet employed OpenAI’s models to build an AI chatbot called MumsGPT last year. MumsGPT was only available to executives at Mumsnet when it was announced and hasn’t been mentioned since, so it may not be around anymore, but the idea was to offer it as a research tool and even as something policymakers could use in developing parenting-related regulations. Roberts didn’t mention MumsGPT but made a point of saying that there are positive potential uses for AI in her explanation of the lawsuit.

“But if the LLMs are allowed to simply steal content from publishers and communities like Mumsnet they risk destroying them,” Roberts wrote. “We know that taking on a multinational giant like OpenAI, with its $3bn of revenues, is not an easy task in the face of the huge resources they’ll throw at us but this is too important an issue to simply roll over. Not just for Mumsnet but for every website you’ve ever landed on for news, advice or simply to ask if you’re being unreasonable.”

You might also like…

https://cdn.mos.cms.futurecdn.net/cfQV43tBr6CJdCYjMmEZPB-1200-80.jpg

Source link
erichs211@gmail.com (Eric Hal Schwartz)

Dreo Smart Misting Fan 516S review: effective, mess-free, and moderately priced

The 4 turntables they really want on Graduation Day — or for their new apartment — as picked by an audio editor

Sennheiser Momentum 5 review: fab features, epic battery — but default sound you’ll have to polish

Sonos Era 100 SL review: cheaper without any acoustic compromises

Ex-Israeli Intelligence Official: Shockwaves of Trump’s “Take Over Gaza” Heard, Felt Across Region

What UK political parties are promising in the 2019 general election

Otto Warmbier’s parents want North Korea to suffer for their son’s death

Could a ‘youthquake’ cause Boris Johnson to lose the general election?

Lisa Lu Reunites With Fans at Shanghai Screening of ‘The Arch’

10 Greatest Dark Fantasy Anime of the 2020s

Jason Momoa’s Forgotten 3-Part Apple TV Sci-Fi Series Is Still Worth a Weekend Binge

14 Major Manga Series Ended Or Expected To End In 2026

Israel says it ’eliminated’ two Hamas and Islamic Jihad operatives tied to major funding network

Will Nifty hit 25,000 this month? Key levels to watch in the week ahead

Will Sensex, Nifty bounce back on Monday? Iran peace deal risks among 5 factors to drive D-St this week

Dividends & bonus issues: LIC, Asian Paints among 35 stocks turning ex-record date this week. How many do you own?

The YouTuber who has become one of Gen Z’s most beloved celebrities

26 last-minute holiday gifts that are still thoughtful and unique

Practicing gratitude regularly can make you less stressed and sleep better

8 things millennials wish you would just stop getting them for the holidays

Mom’s website ready to put OpenAI in a time-out after learning the AI firm may have scrapped its data

Lisa Lu Reunites With Fans at Shanghai Screening of ‘The Arch’

10 Greatest Dark Fantasy Anime of the 2020s

Israel says it ’eliminated’ two Hamas and Islamic Jihad operatives tied to major funding network

Jason Momoa’s Forgotten 3-Part Apple TV Sci-Fi Series Is Still Worth a Weekend Binge