{"id":130174,"date":"2024-10-10T13:00:00","date_gmt":"2024-10-10T13:00:00","guid":{"rendered":"https:\/\/entertainment.runfyers.com\/index.php\/2024\/10\/10\/agents-are-the-future-ai-companies-promise-and-desperately-need\/"},"modified":"2024-10-10T13:00:00","modified_gmt":"2024-10-10T13:00:00","slug":"agents-are-the-future-ai-companies-promise-and-desperately-need","status":"publish","type":"post","link":"https:\/\/entertainment.runfyers.com\/index.php\/2024\/10\/10\/agents-are-the-future-ai-companies-promise-and-desperately-need\/","title":{"rendered":"Agents are the future AI companies promise \u2014 and desperately need"},"content":{"rendered":"<p> <br \/>\n<\/p>\n<div>\n<div class=\"duet--article--article-body-component\">\n<p class=\"duet--article--dangerously-set-cms-markup duet--article--standard-paragraph mb-20 font-fkroman text-18 leading-160 -tracking-1 selection:bg-franklin-20 dark:text-white dark:selection:bg-blurple [&amp;_a:hover]:shadow-highlight-franklin dark:[&amp;_a:hover]:shadow-highlight-blurple [&amp;_a]:shadow-underline-black dark:[&amp;_a]:shadow-underline-white\">Humans have automated tasks for centuries. Now, AI companies see a path to profit in harnessing our love of efficiency, and they\u2019ve got a name for their solution: agents.\u00a0<\/p>\n<\/div>\n<div class=\"duet--article--article-body-component\">\n<p class=\"duet--article--dangerously-set-cms-markup duet--article--standard-paragraph mb-20 font-fkroman text-18 leading-160 -tracking-1 selection:bg-franklin-20 dark:text-white dark:selection:bg-blurple [&amp;_a:hover]:shadow-highlight-franklin dark:[&amp;_a:hover]:shadow-highlight-blurple [&amp;_a]:shadow-underline-black dark:[&amp;_a]:shadow-underline-white\">AI agents are autonomous programs that perform tasks, make decisions, and interact with environments with little human input, and they\u2019re the focus of every major company working on AI today. Microsoft has \u201cCopilots\u201d designed to help businesses automate things like customer service and administrative tasks. Google Cloud CEO Thomas Kurian recently <a href=\"https:\/\/www.leewayhertz.com\/how-to-build-an-ai-agent\/\" target=\"_blank\" rel=\"noopener\">outlined a pitch for six different AI productivity agents<\/a>, and Google DeepMind <a href=\"https:\/\/x.com\/demishassabis\/status\/1841984103312208037\" target=\"_blank\">just poached OpenAI\u2019s co-lead on its AI video product<\/a>, Sora, to <a href=\"https:\/\/deepmind.google\/discover\/blog\/sima-generalist-ai-agent-for-3d-virtual-environments\/\" target=\"_blank\" rel=\"noopener\">work on developing a simulation for training AI agents<\/a>. Anthropic <a href=\"https:\/\/www.theverge.com\/2024\/5\/30\/24167231\/anthropic-claude-ai-assistant-automate-tasks\" target=\"_blank\" rel=\"noopener\">released a feature for its AI chatbot, Claude<\/a>, that will let anyone create their own \u201cAI assistant.\u201d OpenAI includes agents as level 2 in its 5-level approach to reach AGI, or human-level artificial intelligence.<\/p>\n<\/div>\n<div class=\"duet--article--article-body-component\">\n<p class=\"duet--article--dangerously-set-cms-markup duet--article--standard-paragraph mb-20 font-fkroman text-18 leading-160 -tracking-1 selection:bg-franklin-20 dark:text-white dark:selection:bg-blurple [&amp;_a:hover]:shadow-highlight-franklin dark:[&amp;_a:hover]:shadow-highlight-blurple [&amp;_a]:shadow-underline-black dark:[&amp;_a]:shadow-underline-white\">Obviously, computing is full of autonomous systems. Many people have visited a website with a pop-up customer service bot, used an automated voice assistant feature like Alexa Skills, or written <a href=\"https:\/\/www.theverge.com\/2015\/2\/19\/8063877\/ifttt-do-camera-do-note-do-button-IF\" target=\"_blank\" rel=\"noopener\">a humble IFTTT script<\/a>. But AI companies argue \u201cagents\u201d \u2014 you\u2019d better not call them bots \u2014\u00a0are different. Instead of following a simple, rote set of instructions, they believe agents will be able to interact with environments, learn from feedback, and make decisions without constant human input. They could dynamically manage tasks like making purchases, booking travel, or scheduling meetings, adapting to unforeseen circumstances and interacting with systems that could include humans and other AI tools.<\/p>\n<\/div>\n<div class=\"duet--article--article-body-component\">\n<p class=\"duet--article--dangerously-set-cms-markup duet--article--standard-paragraph mb-20 font-fkroman text-18 leading-160 -tracking-1 selection:bg-franklin-20 dark:text-white dark:selection:bg-blurple [&amp;_a:hover]:shadow-highlight-franklin dark:[&amp;_a:hover]:shadow-highlight-blurple [&amp;_a]:shadow-underline-black dark:[&amp;_a]:shadow-underline-white\">Artificial intelligence companies hope that agents will provide a way to monetize powerful, expensive AI models. Venture capital is pouring into AI agent startups that promise to revolutionize how we interact with technology. Businesses envision a leap in efficiency, with agents handling everything from customer service to data analysis. For individuals, AI companies are pitching a new era of productivity where routine tasks are automated, freeing up time for creative and strategic work. The endgame for true believers is to create AI that is a true partner, not just a tool.<\/p>\n<\/div>\n<div class=\"duet--article--article-body-component\">\n<p class=\"duet--article--dangerously-set-cms-markup duet--article--standard-paragraph mb-20 font-fkroman text-18 leading-160 -tracking-1 selection:bg-franklin-20 dark:text-white dark:selection:bg-blurple [&amp;_a:hover]:shadow-highlight-franklin dark:[&amp;_a:hover]:shadow-highlight-blurple [&amp;_a]:shadow-underline-black dark:[&amp;_a]:shadow-underline-white\">\u201cWhat you really want,\u201d OpenAI CEO Sam Altman <a href=\"https:\/\/www.technologyreview.com\/2024\/05\/01\/1091979\/sam-altman-says-helpful-agents-are-poised-to-become-ais-killer-function\/\" target=\"_blank\" rel=\"noopener\">told <em>MIT Technology Review<\/em><\/a> earlier this year, \u201cis just this thing that is off helping you.\u201d Altman described the killer app for AI as a \u201csuper-competent colleague that knows absolutely everything about my whole life, every email, every conversation I\u2019ve ever had, but doesn\u2019t feel like an extension.\u201d It can tackle simple tasks instantly, Altman added, and for more complex ones, it will attempt them but return with questions if needed. Tech companies have been trying to automate the personal assistant since at least the 1970s, and now, they promise they\u2019re finally getting close.<\/p>\n<\/div>\n<div class=\"duet--article--article-body-component\">\n<p class=\"duet--article--dangerously-set-cms-markup duet--article--standard-paragraph mb-20 font-fkroman text-18 leading-160 -tracking-1 selection:bg-franklin-20 dark:text-white dark:selection:bg-blurple [&amp;_a:hover]:shadow-highlight-franklin dark:[&amp;_a:hover]:shadow-highlight-blurple [&amp;_a]:shadow-underline-black dark:[&amp;_a]:shadow-underline-white\">At an OpenAI press event ahead of the company\u2019s annual Dev Day, head of developer experience Romain Huet demonstrated the company\u2019s new Realtime API with an assistant agent. Huet gave the agent a budget and some constraints for buying 400 chocolate-covered strawberries and asked it to place an order via a phone call to a fictitious shop.<\/p>\n<\/div>\n<div class=\"duet--article--article-body-component\">\n<p class=\"duet--article--dangerously-set-cms-markup duet--article--standard-paragraph mb-20 font-fkroman text-18 leading-160 -tracking-1 selection:bg-franklin-20 dark:text-white dark:selection:bg-blurple [&amp;_a:hover]:shadow-highlight-franklin dark:[&amp;_a:hover]:shadow-highlight-blurple [&amp;_a]:shadow-underline-black dark:[&amp;_a]:shadow-underline-white\">The service is similar to a Google reservation-making bot called Duplex from 2018. But that bot could only handle the simplest scenarios \u2014 it turned out <a href=\"https:\/\/www.theverge.com\/2019\/5\/22\/18636138\/google-duplex-human-callers-25-percent-ai-restaurant-booking\" target=\"_blank\" rel=\"noopener\">a quarter of its calls were actually made by humans<\/a>.<\/p>\n<\/div>\n<div class=\"duet--article--article-body-component clear-both block md:float-left md:mr-30 md:w-[320px] lg:-ml-100\">\n<div class=\"duet--article--sidebar bg-gray-200 mb-20 w-full rounded-sm bg-[#F8F5FF] p-20 [&amp;&gt;*:last-child&gt;*:last-child]:mb-0\">\n<div class=\"[&amp;_p]:font-polysans [&amp;_p]:text-16 [&amp;_p]:font-light [&amp;_p]:leading-130\">\n<p class=\"duet--article--dangerously-set-cms-markup duet--article--standard-paragraph mb-20 font-fkroman text-18 leading-160 -tracking-1 selection:bg-franklin-20 dark:text-white dark:selection:bg-blurple [&amp;_a:hover]:shadow-highlight-franklin dark:[&amp;_a:hover]:shadow-highlight-blurple [&amp;_a]:shadow-underline-black dark:[&amp;_a]:shadow-underline-white\"><strong>Do you work at OpenAI?<\/strong> I\u2019d love to chat. You can reach me securely on Signal @kylie.01 or via email at kylie@theverge.com.<\/p>\n<\/div>\n<\/div>\n<\/div>\n<div class=\"duet--article--article-body-component\">\n<p class=\"duet--article--dangerously-set-cms-markup duet--article--standard-paragraph mb-20 font-fkroman text-18 leading-160 -tracking-1 selection:bg-franklin-20 dark:text-white dark:selection:bg-blurple [&amp;_a:hover]:shadow-highlight-franklin dark:[&amp;_a:hover]:shadow-highlight-blurple [&amp;_a]:shadow-underline-black dark:[&amp;_a]:shadow-underline-white\">While that order was placed in English, Huet told me he gave a more complex demo in Tokyo: he prompted an agent to book a hotel room for him in Japanese where it would handle the conversation in Japanese and then call him back in English to confirm it\u2019s done. \u201cOf course, I wouldn\u2019t understand the Japanese part \u2014 it just handles it,\u201d Huet said.<\/p>\n<\/div>\n<div class=\"duet--article--article-body-component\">\n<p class=\"duet--article--dangerously-set-cms-markup duet--article--standard-paragraph mb-20 font-fkroman text-18 leading-160 -tracking-1 selection:bg-franklin-20 dark:text-white dark:selection:bg-blurple [&amp;_a:hover]:shadow-highlight-franklin dark:[&amp;_a:hover]:shadow-highlight-blurple [&amp;_a]:shadow-underline-black dark:[&amp;_a]:shadow-underline-white\">But Huet\u2019s demo immediately sparked concerns in the room full of journalists. Couldn\u2019t the AI assistant be used for spam calls? Why didn\u2019t it identify itself as an AI system? (Huet updated the demo for the official Dev Day, an attendee says, making the agent identify itself as \u201cRomain\u2019s AI Assistant.\u201d) The unease was palpable, and it wasn\u2019t surprising \u2014\u00a0even without agents, AI tools are <a href=\"https:\/\/www.technologyreview.com\/2024\/05\/10\/1092293\/ai-systems-are-getting-better-at-tricking-us\/\" target=\"_blank\" rel=\"noopener\">already being used for deception<\/a>.<\/p>\n<\/div>\n<div class=\"duet--article--article-body-component\">\n<p class=\"duet--article--dangerously-set-cms-markup duet--article--standard-paragraph mb-20 font-fkroman text-18 leading-160 -tracking-1 selection:bg-franklin-20 dark:text-white dark:selection:bg-blurple [&amp;_a:hover]:shadow-highlight-franklin dark:[&amp;_a:hover]:shadow-highlight-blurple [&amp;_a]:shadow-underline-black dark:[&amp;_a]:shadow-underline-white\">There was another, arguably more immediate problem: the demo didn\u2019t work. The agent lacked enough information and incorrectly recorded dessert flavors, causing it to auto-populate flavors like vanilla and strawberry in a column, <a href=\"https:\/\/www.theverge.com\/2024\/9\/17\/24243884\/openai-o1-model-research-safety-alignment\" target=\"_blank\" rel=\"noopener\">rather than saying it didn\u2019t have that information<\/a>. Agents frequently run into issues with multi-step workflows or unexpected scenarios. And they burn more energy than a conventional bot or voice assistant. Their need for significant computational power, especially when reasoning or interacting with multiple systems, makes them costly to run at scale.<\/p>\n<\/div>\n<div class=\"duet--article--article-body-component\">\n<p class=\"duet--article--dangerously-set-cms-markup duet--article--standard-paragraph mb-20 font-fkroman text-18 leading-160 -tracking-1 selection:bg-franklin-20 dark:text-white dark:selection:bg-blurple [&amp;_a:hover]:shadow-highlight-franklin dark:[&amp;_a:hover]:shadow-highlight-blurple [&amp;_a]:shadow-underline-black dark:[&amp;_a]:shadow-underline-white\">AI agents offer a leap in <em>potential<\/em>, but for everyday tasks, they aren\u2019t yet significantly better than bots, assistants, or scripts. OpenAI and other labs aim to enhance their reasoning through reinforcement learning, all while <em>hoping<\/em> <a href=\"https:\/\/www.investopedia.com\/terms\/m\/mooreslaw.asp#:~:text=Moore&#039;s%20Law%20states%20that%20the%20number%20of%20transistors%20on%20a,became%20known%20as%20Moore&#039;s%20Law.\" target=\"_blank\" rel=\"noopener\">Moore\u2019s Law continues<\/a> to deliver cheaper, more powerful computing.<\/p>\n<\/div>\n<div class=\"duet--article--article-body-component\">\n<p class=\"duet--article--dangerously-set-cms-markup duet--article--standard-paragraph mb-20 font-fkroman text-18 leading-160 -tracking-1 selection:bg-franklin-20 dark:text-white dark:selection:bg-blurple [&amp;_a:hover]:shadow-highlight-franklin dark:[&amp;_a:hover]:shadow-highlight-blurple [&amp;_a]:shadow-underline-black dark:[&amp;_a]:shadow-underline-white\">So, if AI agents aren\u2019t yet very useful, why is the idea so popular? In short: market pressures. These companies are sitting on powerful but expensive technology and are desperate to find practical use cases that they can <em>also<\/em> charge users for. The gap between promise and reality also creates a compelling hype cycle that fuels funding, and it just so happens that OpenAI <a href=\"https:\/\/www.theverge.com\/2024\/10\/2\/24260457\/openai-funding-round-thrive-capital-6-billion\" target=\"_blank\" rel=\"noopener\">raised $6.6 billion<\/a> right as it started hyping agents.<\/p>\n<\/div>\n<div class=\"duet--article--article-body-component clear-both block md:float-left md:mr-30 md:w-[320px] lg:-ml-100\">\n<div class=\"duet--article--article-pullquote mb-20\">\n<p class=\"duet--article--dangerously-set-cms-markup relative bg-repeating-lines-dark bg-[length:1px_1.2em] pb-8 font-polysans text-28 font-medium leading-120 tracking-1 selection:bg-franklin-20  dark:bg-repeating-lines-light dark:text-white dark:selection:bg-blurple\">AI agent startups have secured $8.2 billion in investor funding over the last 12 months<\/p>\n<\/div>\n<\/div>\n<div class=\"duet--article--article-body-component\">\n<p class=\"duet--article--dangerously-set-cms-markup duet--article--standard-paragraph mb-20 font-fkroman text-18 leading-160 -tracking-1 selection:bg-franklin-20 dark:text-white dark:selection:bg-blurple [&amp;_a:hover]:shadow-highlight-franklin dark:[&amp;_a:hover]:shadow-highlight-blurple [&amp;_a]:shadow-underline-black dark:[&amp;_a]:shadow-underline-white\">Big tech companies have been rushing to integrate all kinds of \u201cAI\u201d into their products, but they hope AI assistants in particular could be the key to unlocking revenue. Huet\u2019s AI calling demo outpaces what models can currently do at scale, but he told me he expects features like it to appear more commonly as soon as next year, as OpenAI refines its \u201creasoning\u201d o1 model.<\/p>\n<\/div>\n<div class=\"duet--article--article-body-component\">\n<p class=\"duet--article--dangerously-set-cms-markup duet--article--standard-paragraph mb-20 font-fkroman text-18 leading-160 -tracking-1 selection:bg-franklin-20 dark:text-white dark:selection:bg-blurple [&amp;_a:hover]:shadow-highlight-franklin dark:[&amp;_a:hover]:shadow-highlight-blurple [&amp;_a]:shadow-underline-black dark:[&amp;_a]:shadow-underline-white\">For now, the concept seems to be mostly siloed in enterprise software stacks, not products for consumers. Salesforce, which provides customer relationship management (CRM) software, spun up an \u201cagent\u201d feature to great fanfare a few weeks ahead of its annual Dreamforce conference. The feature lets customers use natural language to essentially build a customer service chatbot in a few minutes through Slack, instead of spending a lot of time coding one. The chatbots have access to a company\u2019s CRM data and can process natural language easier than a bot not based on large language models, potentially making them better at limited tasks like asking questions about orders and returns.<\/p>\n<\/div>\n<div class=\"duet--article--article-body-component\">\n<p class=\"duet--article--dangerously-set-cms-markup duet--article--standard-paragraph mb-20 font-fkroman text-18 leading-160 -tracking-1 selection:bg-franklin-20 dark:text-white dark:selection:bg-blurple [&amp;_a:hover]:shadow-highlight-franklin dark:[&amp;_a:hover]:shadow-highlight-blurple [&amp;_a]:shadow-underline-black dark:[&amp;_a]:shadow-underline-white\">AI agent startups (still an admittedly nebulous term) are already becoming quite a buzzy investment. They\u2019ve secured $8.2 billion in investor funding over the last 12 months, spread over 156 deals, an increase of 81.4 percent year over year, <a href=\"https:\/\/fortune.com\/2024\/09\/24\/ai-agent-startups-deal-count-up-81-percent-year-over-year-pitchbook\/\" target=\"_blank\" rel=\"noopener\">according to PitchBook data<\/a>. One of the better-known projects is Sierra, a customer service agent similar to Salesforce\u2019s latest project and <a href=\"https:\/\/fortune.com\/2024\/02\/13\/bret-taylor-clay-bavor-ai-startup-sierra-110-million-funding-sequoia-benchmark\/\" target=\"_blank\" rel=\"noopener\">launched by former Salesforce co-CEO Bret Taylor<\/a>. There\u2019s also Harvey, which offers AI agents for lawyers, and TaxGPT, an AI agent to handle your taxes.<\/p>\n<\/div>\n<div class=\"duet--article--article-body-component\">\n<p class=\"duet--article--dangerously-set-cms-markup duet--article--standard-paragraph mb-20 font-fkroman text-18 leading-160 -tracking-1 selection:bg-franklin-20 dark:text-white dark:selection:bg-blurple [&amp;_a:hover]:shadow-highlight-franklin dark:[&amp;_a:hover]:shadow-highlight-blurple [&amp;_a]:shadow-underline-black dark:[&amp;_a]:shadow-underline-white\">Despite all the enthusiasm for agents, these high-stakes uses raise a clear question: can they actually be trusted with something as serious as law or taxes? AI hallucinations, which have frequently tripped up users of ChatGPT, currently have no remedy in sight. More fundamentally, as <a href=\"https:\/\/images.app.goo.gl\/oh2uFMXQtyNs8v9f6\" target=\"_blank\" rel=\"noopener\">IBM presciently stated in 1979<\/a>, \u201ca computer can never be held accountable\u201d \u2014 and as a corollary, \u201ca computer must never make a management decision.\u201d Rather than autonomous decision-makers, AI assistants are best viewed as what they truly are: powerful but imperfect tools for low-stakes tasks. Is that worth the big bucks AI companies hope people will pay?<\/p>\n<\/div>\n<div class=\"duet--article--article-body-component\">\n<p class=\"duet--article--dangerously-set-cms-markup duet--article--standard-paragraph mb-20 font-fkroman text-18 leading-160 -tracking-1 selection:bg-franklin-20 dark:text-white dark:selection:bg-blurple [&amp;_a:hover]:shadow-highlight-franklin dark:[&amp;_a:hover]:shadow-highlight-blurple [&amp;_a]:shadow-underline-black dark:[&amp;_a]:shadow-underline-white\">For now, market pressures prevail, and AI companies are racing to monetize. \u201cI think 2025 is going to be the year that agentic systems finally hit the mainstream,\u201d OpenAI\u2019s new chief product officer, Kevin Weil, said at the press event. \u201cAnd if we do it right, it takes us to a world where we actually get to spend more time on the human things that matter, and a little less time staring at our phones.\u201d<\/p>\n<\/div>\n<\/div>\n<p><br \/>\n<br \/><a href=\"https:\/\/www.theverge.com\/2024\/10\/10\/24266333\/ai-agents-assistants-openai-google-deepmind-bots\" target=\"_blank\" rel=\"noopener\">Source link <\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Humans have automated tasks for centuries. Now, AI companies see a path to profit in harnessing our love of efficiency, and they\u2019ve got a name for their solution: agents.\u00a0 AI agents are autonomous programs that perform tasks, make decisions, and interact with environments with little human input, and they\u2019re the focus of every major company [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":130175,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[14],"tags":[],"class_list":{"0":"post-130174","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-tech"},"_links":{"self":[{"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/posts\/130174","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/comments?post=130174"}],"version-history":[{"count":0,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/posts\/130174\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/media\/130175"}],"wp:attachment":[{"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/media?parent=130174"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/categories?post=130174"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/tags?post=130174"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}