{"id":95890,"date":"2024-05-08T19:52:00","date_gmt":"2024-05-08T19:52:00","guid":{"rendered":"https:\/\/entertainment.runfyers.com\/index.php\/2024\/05\/08\/openai-offers-a-peek-behind-the-curtain-of-its-ais-secret-instructions-techcrunch\/"},"modified":"2024-05-08T19:52:00","modified_gmt":"2024-05-08T19:52:00","slug":"openai-offers-a-peek-behind-the-curtain-of-its-ais-secret-instructions-techcrunch","status":"publish","type":"post","link":"https:\/\/entertainment.runfyers.com\/index.php\/2024\/05\/08\/openai-offers-a-peek-behind-the-curtain-of-its-ais-secret-instructions-techcrunch\/","title":{"rendered":"OpenAI offers a peek behind the curtain of its AI&#8217;s secret instructions | TechCrunch"},"content":{"rendered":"<p> <br \/>\n<\/p>\n<div>\n<p id=\"speakable-summary\" class=\"wp-block-paragraph\">Ever wonder why conversational AI like ChatGPT says \u201cSorry, I can\u2019t do that\u201d or some other polite refusal? OpenAI is offering a limited look at the reasoning behind its own models\u2019 rules of engagement, whether it\u2019s sticking to brand guidelines or declining to make NSFW content.<\/p>\n<p class=\"wp-block-paragraph\">Large language models (LLMs) don\u2019t have any naturally occurring limits on what they can or will say. That\u2019s part of why they\u2019re so versatile, but also why they hallucinate and are easily duped.<\/p>\n<p class=\"wp-block-paragraph\">It\u2019s necessary for any AI model that interacts with the general public <a href=\"https:\/\/techcrunch.com\/2024\/02\/23\/embarrassing-and-wrong-google-admits-it-lost-control-of-image-generating-ai\/\" target=\"_blank\" rel=\"noopener\">to have a few guardrails<\/a> on what it should and shouldn\u2019t do, but defining these \u2014 let alone enforcing them \u2014 is a surprisingly difficult task.<\/p>\n<p class=\"wp-block-paragraph\">If someone asks an AI to generate a bunch of false claims about a public figure, it should refuse, right? But what if they\u2019re an AI developer themselves, creating a database of synthetic disinformation for a detector model?<\/p>\n<p class=\"wp-block-paragraph\">What if someone asks for laptop recommendations; it should be objective, right? But what if the model is being deployed by a laptop maker who wants it to only respond with their own devices?<\/p>\n<p class=\"wp-block-paragraph\">AI makers are all navigating conundrums like these and looking for efficient methods to rein in their models without causing them to refuse perfectly normal requests. But they seldom share exactly how they do it.<\/p>\n<p class=\"wp-block-paragraph\">OpenAI is bucking the trend a bit by publishing what it calls its \u201cmodel spec,\u201d a collection of high-level rules that indirectly govern ChatGPT and other models.<\/p>\n<p class=\"wp-block-paragraph\">There are meta-level objectives, some hard rules and some general behavior guidelines, though to be clear these are not strictly speaking what the model is primed with; OpenAI will have developed specific instructions that accomplish what these rules describe in natural language.<\/p>\n<p class=\"wp-block-paragraph\">It\u2019s an interesting look at how a company sets its priorities and handles edge cases. And there are <a href=\"https:\/\/cdn.openai.com\/spec\/model-spec-2024-05-08.html\" target=\"_blank\" rel=\"noreferrer noopener\">numerous examples of how they might play out<\/a>.<\/p>\n<p class=\"wp-block-paragraph\">For instance, OpenAI states clearly that the developer intent is basically the highest law. So one version of a chatbot running GPT-4 might provide the answer to a math problem when asked for it. But if that chatbot has been primed by its developer to never simply provide an answer straight out, it will instead offer to work through the solution step by step:<\/p>\n<figure class=\"wp-block-image size-large\"><figcaption class=\"wp-element-caption\"><strong>Image Credits:<\/strong> OpenAI<\/figcaption><\/figure>\n<p class=\"wp-block-paragraph\">A conversational interface might even decline to talk about anything not approved, in order to nip any manipulation attempts in the bud. Why even let a cooking assistant weigh in on U.S. involvement in the Vietnam War? Why should a customer service chatbot agree to help with your erotic supernatural novella work in progress? Shut it down.<\/p>\n<p class=\"wp-block-paragraph\">It also gets sticky in matters of privacy, like asking for someone\u2019s name and phone number. As OpenAI points out, obviously a public figure like a mayor or member of Congress should have their contact details provided, but what about tradespeople in the area? That\u2019s probably OK \u2014 but what about employees of a certain company, or members of a political party? Probably not.<\/p>\n<p class=\"wp-block-paragraph\">Choosing when and where to draw the line isn\u2019t simple. Nor is creating the instructions that cause the AI to adhere to the resulting policy. And no doubt these policies will fail all the time as people learn to circumvent them or accidentally find edge cases that aren\u2019t accounted for.<\/p>\n<p class=\"wp-block-paragraph\">OpenAI isn\u2019t showing its whole hand here, but it\u2019s helpful to users and developers to see how these rules and guidelines are set and why, set out clearly if not necessarily comprehensively.<\/p>\n<\/div>\n<p><br \/>\n<br \/><a href=\"https:\/\/techcrunch.com\/2024\/05\/08\/openai-offers-a-peek-behind-the-curtain-of-its-ais-secret-instructions\/\" target=\"_blank\" rel=\"noopener\">Source link <\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Ever wonder why conversational AI like ChatGPT says \u201cSorry, I can\u2019t do that\u201d or some other polite refusal? OpenAI is offering a limited look at the reasoning behind its own models\u2019 rules of engagement, whether it\u2019s sticking to brand guidelines or declining to make NSFW content. Large language models (LLMs) don\u2019t have any naturally occurring [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":95891,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[14],"tags":[],"class_list":{"0":"post-95890","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-tech"},"_links":{"self":[{"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/posts\/95890","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/comments?post=95890"}],"version-history":[{"count":0,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/posts\/95890\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/media\/95891"}],"wp:attachment":[{"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/media?parent=95890"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/categories?post=95890"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/tags?post=95890"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}