{"id":117157,"date":"2024-08-08T20:04:08","date_gmt":"2024-08-08T20:04:08","guid":{"rendered":"https:\/\/entertainment.runfyers.com\/index.php\/2024\/08\/08\/openai-says-its-latest-gpt-4o-model-is-medium-risk\/"},"modified":"2024-08-08T20:04:08","modified_gmt":"2024-08-08T20:04:08","slug":"openai-says-its-latest-gpt-4o-model-is-medium-risk","status":"publish","type":"post","link":"https:\/\/entertainment.runfyers.com\/index.php\/2024\/08\/08\/openai-says-its-latest-gpt-4o-model-is-medium-risk\/","title":{"rendered":"OpenAI says its latest GPT-4o model is \u2018medium\u2019 risk"},"content":{"rendered":"<p> <br \/>\n<\/p>\n<div>\n<p class=\"duet--article--dangerously-set-cms-markup duet--article--standard-paragraph mb-20 font-fkroman text-18 leading-160 -tracking-1 selection:bg-franklin-20 dark:text-white dark:selection:bg-blurple [&amp;_a:hover]:shadow-highlight-franklin dark:[&amp;_a:hover]:shadow-highlight-blurple [&amp;_a]:shadow-underline-black dark:[&amp;_a]:shadow-underline-white\">OpenAI has released its <a href=\"https:\/\/openai.com\/index\/gpt-4o-system-card\/\" target=\"_blank\" rel=\"noopener\">GPT-4o System Card<\/a>, a research document that outlines the safety measures and risk evaluations the startup conducted before releasing its latest model.<\/p>\n<\/div>\n<div>\n<p class=\"duet--article--dangerously-set-cms-markup duet--article--standard-paragraph mb-20 font-fkroman text-18 leading-160 -tracking-1 selection:bg-franklin-20 dark:text-white dark:selection:bg-blurple [&amp;_a:hover]:shadow-highlight-franklin dark:[&amp;_a:hover]:shadow-highlight-blurple [&amp;_a]:shadow-underline-black dark:[&amp;_a]:shadow-underline-white\">GPT-4o <a href=\"https:\/\/www.theverge.com\/2024\/5\/13\/24155493\/openai-gpt-4o-launching-free-for-all-chatgpt-users\" target=\"_blank\" rel=\"noopener\">was launched publicly in May<\/a> of this year. Before its debut, OpenAI used an external group of red teamers, or security experts trying to find weaknesses in a system, to find key risks in the model (which is a fairly standard practice). They examined risks like the possibility that GPT-4o would create unauthorized clones of someone\u2019s voice, erotic and violent content, or chunks of reproduced copyrighted audio. Now, the results are being released.<\/p>\n<\/div>\n<div>\n<p class=\"duet--article--dangerously-set-cms-markup duet--article--standard-paragraph mb-20 font-fkroman text-18 leading-160 -tracking-1 selection:bg-franklin-20 dark:text-white dark:selection:bg-blurple [&amp;_a:hover]:shadow-highlight-franklin dark:[&amp;_a:hover]:shadow-highlight-blurple [&amp;_a]:shadow-underline-black dark:[&amp;_a]:shadow-underline-white\">According to OpenAI\u2019s own framework, the researchers found GPT-4o to be of \u201cmedium\u201d risk. The overall risk level was taken from the highest risk rating of four overall categories: cybersecurity, <a href=\"https:\/\/openai.com\/index\/building-an-early-warning-system-for-llm-aided-biological-threat-creation\/\" target=\"_blank\" rel=\"noopener\">biological threats<\/a>, persuasion, and model autonomy. All of these were deemed low risk except persuasion, where the researchers found some writing samples from GPT-4o could be better at swaying readers\u2019 opinions than human-written text \u2014 although the model\u2019s samples weren\u2019t more persuasive overall.<\/p>\n<\/div>\n<div>\n<p class=\"duet--article--dangerously-set-cms-markup duet--article--standard-paragraph mb-20 font-fkroman text-18 leading-160 -tracking-1 selection:bg-franklin-20 dark:text-white dark:selection:bg-blurple [&amp;_a:hover]:shadow-highlight-franklin dark:[&amp;_a:hover]:shadow-highlight-blurple [&amp;_a]:shadow-underline-black dark:[&amp;_a]:shadow-underline-white\">An OpenAI spokesperson, Lindsay McCallum R\u00e9my, told <em>The Verge<\/em> that the system card includes preparedness evaluations created by an internal team, alongside external testers <a href=\"https:\/\/openai.com\/index\/gpt-4o-system-card\/external-testers-acknowledgements\/\" target=\"_blank\" rel=\"noopener\">listed on OpenAI\u2019s website<\/a> as Model Evaluation and Threat Research (METR) and Apollo Research, both of which build evaluations for AI systems.<\/p>\n<\/div>\n<div>\n<p class=\"duet--article--dangerously-set-cms-markup duet--article--standard-paragraph mb-20 font-fkroman text-18 leading-160 -tracking-1 selection:bg-franklin-20 dark:text-white dark:selection:bg-blurple [&amp;_a:hover]:shadow-highlight-franklin dark:[&amp;_a:hover]:shadow-highlight-blurple [&amp;_a]:shadow-underline-black dark:[&amp;_a]:shadow-underline-white\">Moreover, the company is releasing a highly capable multimodal model just ahead of a US presidential election. There\u2019s a clear potential risk of the model accidentally spreading misinformation or getting hijacked by malicious actors \u2014 even if OpenAI is hoping to highlight that the company is testing real-world scenarios to prevent misuse.<\/p>\n<\/div>\n<div>\n<p class=\"duet--article--dangerously-set-cms-markup duet--article--standard-paragraph mb-20 font-fkroman text-18 leading-160 -tracking-1 selection:bg-franklin-20 dark:text-white dark:selection:bg-blurple [&amp;_a:hover]:shadow-highlight-franklin dark:[&amp;_a:hover]:shadow-highlight-blurple [&amp;_a]:shadow-underline-black dark:[&amp;_a]:shadow-underline-white\">There have been plenty of calls for OpenAI to be more transparent, not just with the model\u2019s training data (<a href=\"https:\/\/www.theverge.com\/2024\/4\/6\/24122915\/openai-youtube-transcripts-gpt-4-training-data-google\" target=\"_blank\" rel=\"noopener\">is it trained on YouTube?<\/a>), but with its safety testing. In California, where OpenAI and many other leading AI labs are based, state Sen. Scott Wiener is working to pass a bill to regulate large language models, including restrictions that would hold companies legally accountable if their AI is used in harmful ways. If that bill is passed, OpenAI\u2019s frontier models would have to comply with state-mandated risk assessments before making models available for public use. But the biggest takeaway from the GPT-4o System Card is that, despite the group of external red teamers and testers, a lot of this relies on OpenAI to evaluate itself.<\/p>\n<\/div>\n<p><br \/>\n<br \/><a href=\"https:\/\/www.theverge.com\/2024\/8\/8\/24216193\/openai-safety-assessment-gpt-4o\" target=\"_blank\" rel=\"noopener\">Source link <\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>OpenAI has released its GPT-4o System Card, a research document that outlines the safety measures and risk evaluations the startup conducted before releasing its latest model. GPT-4o was launched publicly in May of this year. Before its debut, OpenAI used an external group of red teamers, or security experts trying to find weaknesses in a [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":117158,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[14],"tags":[],"class_list":{"0":"post-117157","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-tech"},"_links":{"self":[{"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/posts\/117157","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/comments?post=117157"}],"version-history":[{"count":0,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/posts\/117157\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/media\/117158"}],"wp:attachment":[{"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/media?parent=117157"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/categories?post=117157"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/tags?post=117157"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}