{"id":211685,"date":"2025-12-18T00:44:55","date_gmt":"2025-12-18T00:44:55","guid":{"rendered":"https:\/\/entertainment.runfyers.com\/index.php\/2025\/12\/18\/adobe-hit-with-proposed-class-action-accused-of-misusing-authors-work-in-ai-training-techcrunch\/"},"modified":"2025-12-18T00:44:55","modified_gmt":"2025-12-18T00:44:55","slug":"adobe-hit-with-proposed-class-action-accused-of-misusing-authors-work-in-ai-training-techcrunch","status":"publish","type":"post","link":"https:\/\/entertainment.runfyers.com\/index.php\/2025\/12\/18\/adobe-hit-with-proposed-class-action-accused-of-misusing-authors-work-in-ai-training-techcrunch\/","title":{"rendered":"Adobe hit with proposed class-action, accused of misusing authors&#8217; work in AI training | TechCrunch"},"content":{"rendered":"<p> <br \/>\n<\/p>\n<div>\n<p id=\"speakable-summary\" class=\"wp-block-paragraph\">Like pretty much every other tech company in existence, Adobe has leaned heavily into AI over the past several years. The software firm has launched a number of different AI services since 2023, including<a href=\"https:\/\/techcrunch.com\/2025\/12\/16\/adobe-firefly-now-supports-prompt-based-video-editing-adds-more-third-party-models\/\" target=\"_blank\" rel=\"noopener\"> Firefly<\/a>\u2014its AI-powered media-generation suite. Now, however, the company\u2019s full-throated embrace of the technology may have led to trouble, as a new lawsuit claims it used pirated books to train one of its AI models.<\/p>\n<p class=\"wp-block-paragraph\">A proposed class-action lawsuit filed on behalf of Elizabeth Lyon, an author from Oregon, claims that Adobe used pirated versions of numerous books \u2014 including her own \u2014 to train the company\u2019s <a href=\"https:\/\/arxiv.org\/html\/2411.09944v1\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">SlimLM program<\/a>.<\/p>\n<p class=\"wp-block-paragraph\">Adobe describes SlimLM as a small language model series that can be \u201coptimized for document assistance tasks on mobile devices.\u201d It<a href=\"https:\/\/research.adobe.com\/publication\/slimlm-an-efficient-small-language-model-for-on-device-document-assistance\/\" target=\"_blank\" rel=\"noreferrer noopener nofollow\"> states that<\/a> SlimLM was pre-trained on SlimPajama-627B, a \u201cdeduplicated, multi-corpora, open-source dataset\u201d<a href=\"https:\/\/www.cerebras.ai\/blog\/slimpajama-a-627b-token-cleaned-and-deduplicated-version-of-redpajama\" target=\"_blank\" rel=\"noreferrer noopener nofollow\"> released by Cerebras<\/a> in June of 2023. Lyon, who has written a number of guidebooks for non-fiction writing, says that some of her works were included in a pretraining dataset that Adobe had used.<\/p>\n<p class=\"wp-block-paragraph\">Lyon\u2019s lawsuit, which was<a href=\"https:\/\/www.reuters.com\/legal\/government\/adobe-sued-allegedly-misusing-authors-work-ai-training-2025-12-17\/\" target=\"_blank\" rel=\"noreferrer noopener nofollow\"> originally reported<\/a> on by Reuters, says that her writing was included in a processed subset of a manipulated dataset that was the basis of Adobe\u2019s program: \u201cThe SlimPajama dataset was created by copying and manipulating the RedPajama dataset (including copying Books3),\u201d the lawsuit says. \u201cThus, because it is a derivative copy of the RedPajama dataset, SlimPajama contains the Books3 dataset, including the copyrighted works of Plaintiff and the Class members.\u201d<\/p>\n<p class=\"wp-block-paragraph\">\u201cBooks3\u2033\u2014a huge <a href=\"https:\/\/www.theatlantic.com\/technology\/archive\/2023\/09\/books3-database-generative-ai-training-copyright-infringement\/675363\/\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">collection of 191,000 books<\/a> that have been used to train genAI systems\u2014has been an ongoing source of legal trouble for the tech community. RedPajama has also been cited in a number of litigation cases. In September, <a href=\"https:\/\/www.macobserver.com\/news\/apple-faces-lawsuit-over-use-of-pirated-books-to-train-ai-models\/\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">a lawsuit <\/a>against Apple claimed the company had used copyrighted material to <a href=\"https:\/\/www.reuters.com\/sustainability\/boards-policy-regulation\/apple-sued-over-use-copyrighted-books-train-apple-intelligence-2025-10-10\/\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">train its Apple Intelligence model<\/a>. The litigation mentioned the dataset and accused the tech company of copying protected works \u201cwithout consent and without credit or compensation.\u201d In October, a similar lawsuit against Salesforce<a href=\"https:\/\/www.jdsupra.com\/legalnews\/salesforce-used-pirated-books-to-train-9970854\/\" target=\"_blank\" rel=\"noreferrer noopener nofollow\"> also<\/a> claimed the company had used RedPajama for training purposes.\u00a0<\/p>\n<p class=\"wp-block-paragraph\">Unfortunately for the tech industry, such lawsuits have, by now, become somewhat commonplace. AI algorithms are trained on massive datasets and, in some cases, those datasets have allegedly including pirated materials. In September, Anthropic<a href=\"https:\/\/www.npr.org\/2025\/09\/05\/g-s1-87367\/anthropic-authors-settlement-pirated-chatbot-training-material\" target=\"_blank\" rel=\"noreferrer noopener nofollow\"> agreed to pay $1.5 billion<\/a> to a number of authors who had sued it and accused it of using pirated versions of their work to train its chatbot, Claude. The case was considered a potential turning point in the ongoing legal battles over copyrighted material in AI training data, of which there are many.<\/p>\n<\/div>\n<p><br \/>\n<br \/><a href=\"https:\/\/techcrunch.com\/2025\/12\/17\/adobe-hit-with-proposed-class-action-accused-of-misusing-authors-work-in-ai-training\/\" target=\"_blank\" rel=\"noopener\">Source link <\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Like pretty much every other tech company in existence, Adobe has leaned heavily into AI over the past several years. The software firm has launched a number of different AI services since 2023, including Firefly\u2014its AI-powered media-generation suite. Now, however, the company\u2019s full-throated embrace of the technology may have led to trouble, as a new [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":211686,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[14],"tags":[],"class_list":{"0":"post-211685","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-tech"},"_links":{"self":[{"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/posts\/211685","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/comments?post=211685"}],"version-history":[{"count":0,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/posts\/211685\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/media\/211686"}],"wp:attachment":[{"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/media?parent=211685"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/categories?post=211685"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/tags?post=211685"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}