{"id":31719,"date":"2023-08-14T10:26:27","date_gmt":"2023-08-14T10:26:27","guid":{"rendered":"https:\/\/entertainment.runfyers.com\/index.php\/2023\/08\/14\/the-new-york-times-says-you-cant-use-its-content-to-train-ai-models\/"},"modified":"2023-08-14T10:26:27","modified_gmt":"2023-08-14T10:26:27","slug":"the-new-york-times-says-you-cant-use-its-content-to-train-ai-models","status":"publish","type":"post","link":"https:\/\/entertainment.runfyers.com\/index.php\/2023\/08\/14\/the-new-york-times-says-you-cant-use-its-content-to-train-ai-models\/","title":{"rendered":"The New York Times says you can\u2019t use its content to train AI models"},"content":{"rendered":"<p> <br \/>\n<\/p>\n<div>\n<p class=\"duet--article--dangerously-set-cms-markup duet--article--standard-paragraph mb-20 font-fkroman text-18 leading-160 -tracking-1 selection:bg-franklin-20 dark:text-white dark:selection:bg-blurple [&amp;_a:hover]:shadow-highlight-franklin dark:[&amp;_a:hover]:shadow-highlight-blurple [&amp;_a]:shadow-underline-black dark:[&amp;_a]:shadow-underline-white\"><em>The New York Times<\/em> has taken preemptive measures to stop its content from being used to train artificial intelligence models. As reported by <a href=\"https:\/\/www.adweek.com\/media\/the-new-york-times-updates-terms-of-service-to-prevent-ai-scraping-its-content\/\" target=\"_blank\" rel=\"noopener\"><em>Adweek<\/em><\/a>, the <em>NYT<\/em> updated its <a href=\"https:\/\/help.nytimes.com\/hc\/en-us\/articles\/115014893428-Terms-of-Service\" target=\"_blank\" rel=\"noopener\">Terms of Service<\/a> on August 3rd to prohibit its content \u2014 inclusive of text, photographs, images, audio\/video clips, \u201clook and feel,\u201d metadata, or compilations \u2014 from being used in the development of \u201cany software program, including, but not limited to, training a machine learning or artificial intelligence (AI) system.\u201d<\/p>\n<\/div>\n<div>\n<p class=\"duet--article--dangerously-set-cms-markup duet--article--standard-paragraph mb-20 font-fkroman text-18 leading-160 -tracking-1 selection:bg-franklin-20 dark:text-white dark:selection:bg-blurple [&amp;_a:hover]:shadow-highlight-franklin dark:[&amp;_a:hover]:shadow-highlight-blurple [&amp;_a]:shadow-underline-black dark:[&amp;_a]:shadow-underline-white\">The updated terms now also specify that automated tools like website crawlers designed to use, access, or collect such content cannot be used without written permission from the publication. The <em>NYT<\/em> says that refusing to comply with these new restrictions could result in unspecified fines or penalties. Despite introducing the new rules to its policy, the publication doesn\u2019t appear to have made any changes to its <a href=\"https:\/\/www.nytimes.com\/robots.txt\" target=\"_blank\" rel=\"noopener\">robots.txt<\/a> \u2014 the file that informs search engine crawlers which URLs can be accessed.<\/p>\n<\/div>\n<div>\n<div class=\"duet--article--article-pullquote mb-20\">\n<p class=\"duet--article--dangerously-set-cms-markup relative bg-repeating-lines-dark bg-[length:1px_1.2em] pb-8 font-polysans text-28 font-medium leading-120 tracking-1 selection:bg-franklin-20  dark:bg-repeating-lines-light dark:text-white dark:selection:bg-blurple\">Google recently granted itself permission to train its AI services on public data it collects from the web.<\/p>\n<\/div>\n<\/div>\n<div>\n<p class=\"duet--article--dangerously-set-cms-markup duet--article--standard-paragraph mb-20 font-fkroman text-18 leading-160 -tracking-1 selection:bg-franklin-20 dark:text-white dark:selection:bg-blurple [&amp;_a:hover]:shadow-highlight-franklin dark:[&amp;_a:hover]:shadow-highlight-blurple [&amp;_a]:shadow-underline-black dark:[&amp;_a]:shadow-underline-white\">The move could be in response to a recent update to Google\u2019s privacy policy that discloses <a href=\"https:\/\/www.theverge.com\/2023\/7\/5\/23784257\/google-ai-bard-privacy-policy-train-web-scraping\" target=\"_blank\" rel=\"noopener\">the search giant may collect public data<\/a> from the web to train its various AI services, such as Bard or Cloud AI. Many large language models powering popular AI services like <a href=\"https:\/\/www.theverge.com\/2023\/3\/15\/23640180\/openai-gpt-4-launch-closed-research-ilya-sutskever-interview\" target=\"_blank\" rel=\"noopener\">OpenAI\u2019s ChatGPT<\/a> are trained on vast datasets that could contain copyrighted or otherwise protected materials scraped from the web without the original creator\u2019s permission.<\/p>\n<\/div>\n<div>\n<p class=\"duet--article--dangerously-set-cms-markup duet--article--standard-paragraph mb-20 font-fkroman text-18 leading-160 -tracking-1 selection:bg-franklin-20 dark:text-white dark:selection:bg-blurple [&amp;_a:hover]:shadow-highlight-franklin dark:[&amp;_a:hover]:shadow-highlight-blurple [&amp;_a]:shadow-underline-black dark:[&amp;_a]:shadow-underline-white\">That said, the <em>NYT<\/em> also signed a $100 million deal with Google <a href=\"https:\/\/www.nytco.com\/press\/the-new-york-times-company-and-google-expand-agreement-on-news-and-innovation\/\" target=\"_blank\" rel=\"noopener\">back in February<\/a> that allows the search giant to feature <em>Times<\/em> content across some of its platforms over the next three years. The publication said that both companies will work together on tools for content distribution, subscriptions, marketing, ads, and \u201cexperimentation,\u201d so it\u2019s possible that the changes to the <em>NYT<\/em> terms of service are directed at other companies like OpenAI or Microsoft.<\/p>\n<\/div>\n<div>\n<p class=\"duet--article--dangerously-set-cms-markup duet--article--standard-paragraph mb-20 font-fkroman text-18 leading-160 -tracking-1 selection:bg-franklin-20 dark:text-white dark:selection:bg-blurple [&amp;_a:hover]:shadow-highlight-franklin dark:[&amp;_a:hover]:shadow-highlight-blurple [&amp;_a]:shadow-underline-black dark:[&amp;_a]:shadow-underline-white\">OpenAI recently announced that website operators can now <a href=\"https:\/\/www.theverge.com\/2023\/8\/7\/23823046\/openai-data-scrape-block-ai\" target=\"_blank\" rel=\"noopener\">block its GPTBot web crawler<\/a> from scraping their websites. Microsoft also <a href=\"https:\/\/venturebeat.com\/ai\/microsoft-changes-services-agreement-to-add-restrictions-for-ai-offerings\/\" target=\"_blank\" rel=\"noopener\">added some new restrictions to its own <\/a>T&amp;Cs that ban people from using its AI products to \u201ccreate, train, or improve (directly or indirectly) any other AI service,\u201d alongside banning users from scraping or otherwise extracting data from its AI tools.<\/p>\n<\/div>\n<div>\n<p class=\"duet--article--dangerously-set-cms-markup duet--article--standard-paragraph mb-20 font-fkroman text-18 leading-160 -tracking-1 selection:bg-franklin-20 dark:text-white dark:selection:bg-blurple [&amp;_a:hover]:shadow-highlight-franklin dark:[&amp;_a:hover]:shadow-highlight-blurple [&amp;_a]:shadow-underline-black dark:[&amp;_a]:shadow-underline-white\">Earlier this month, several news organizations including <em>The Associated Press<\/em> and the <em>European Publishers\u2019 Council<\/em> <a href=\"https:\/\/www.theverge.com\/2023\/8\/10\/23827316\/news-transparency-copyright-generative-ai\" target=\"_blank\" rel=\"noopener\">signed an open letter<\/a> calling for global lawmakers to usher in rules that would require transparency into training datasets and consent of rights holders before using data for training.<\/p>\n<\/div>\n<p><br \/>\n<br \/><a href=\"https:\/\/www.theverge.com\/2023\/8\/14\/23831109\/the-new-york-times-ai-web-scraping-rules-terms-of-service\" target=\"_blank\" rel=\"noopener\">Source link <\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>The New York Times has taken preemptive measures to stop its content from being used to train artificial intelligence models. As reported by Adweek, the NYT updated its Terms of Service on August 3rd to prohibit its content \u2014 inclusive of text, photographs, images, audio\/video clips, \u201clook and feel,\u201d metadata, or compilations \u2014 from being [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":31720,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[14],"tags":[],"class_list":{"0":"post-31719","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-tech"},"_links":{"self":[{"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/posts\/31719","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/comments?post=31719"}],"version-history":[{"count":0,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/posts\/31719\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/media\/31720"}],"wp:attachment":[{"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/media?parent=31719"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/categories?post=31719"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/tags?post=31719"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}