{"id":185148,"date":"2025-08-05T14:10:14","date_gmt":"2025-08-05T14:10:14","guid":{"rendered":"https:\/\/entertainment.runfyers.com\/index.php\/2025\/08\/05\/deepmind-thinks-its-new-genie-3-world-model-presents-a-stepping-stone-toward-agi-techcrunch\/"},"modified":"2025-08-05T14:10:14","modified_gmt":"2025-08-05T14:10:14","slug":"deepmind-thinks-its-new-genie-3-world-model-presents-a-stepping-stone-toward-agi-techcrunch","status":"publish","type":"post","link":"https:\/\/entertainment.runfyers.com\/index.php\/2025\/08\/05\/deepmind-thinks-its-new-genie-3-world-model-presents-a-stepping-stone-toward-agi-techcrunch\/","title":{"rendered":"DeepMind thinks its new Genie 3 world model presents a stepping stone toward AGI | TechCrunch"},"content":{"rendered":"<p> <br \/>\n<\/p>\n<div>\n<p id=\"speakable-summary\" class=\"wp-block-paragraph\">Google DeepMind has revealed Genie 3, its latest foundation world model that can be used to train general-purpose AI agents, a capability that the AI lab says makes for a crucial stepping stone on the path to \u201cartificial general intelligence,\u201d or human-like intelligence.\u00a0<\/p>\n<p class=\"wp-block-paragraph\">\u201cGenie 3 is the first real-time interactive general-purpose world model,\u201d Shlomi Fruchter, a research director at DeepMind, said during a press briefing. \u201cIt goes beyond narrow world models that existed before. It\u2019s not specific to any particular environment. It can generate both photo-realistic and imaginary worlds, and everything in between.\u201d<\/p>\n<p class=\"wp-block-paragraph\">Still in research preview and not publicly available, Genie 3 builds on both its predecessor <a href=\"https:\/\/techcrunch.com\/2024\/12\/04\/deepminds-genie-2-can-generate-interactive-worlds-that-look-like-video-games\/\" target=\"_blank\" rel=\"noopener\">Genie 2<\/a> (which can generate new environments for agents) and DeepMind\u2019s latest video generation model <a href=\"https:\/\/techcrunch.com\/2025\/07\/03\/google-rolls-out-its-new-veo-3-video-generation-model-globally\/\" target=\"_blank\" rel=\"noopener\">Veo 3<\/a> (which is said to have a deep understanding of physics).\u00a0<\/p>\n<figure class=\"wp-block-image aligncenter size-large\"><figcaption class=\"wp-element-caption\"><span class=\"wp-block-image__credits\"><strong>Image Credits:<\/strong>Google DeepMind<\/span><\/figcaption><\/figure>\n<p class=\"wp-block-paragraph\">With a simple text prompt, Genie 3 can generate multiple minutes of interactive 3D environments at 720p resolution at 24 frames per second \u2014 a significant jump from the 10 to 20 seconds Genie 2 could produce. The model also features \u201cpromptable world events,\u201d or the ability to use a prompt to change the generated world.<\/p>\n<p class=\"wp-block-paragraph\">Perhaps most importantly, Genie 3\u2019s simulations stay physically consistent over time because the model can remember what it previously generated \u2014 a capability that DeepMind says its researchers didn\u2019t explicitly program into the model.\u00a0<\/p>\n<p class=\"wp-block-paragraph\">Fruchter said that while Genie 3 has implications for educational experiences, <a href=\"https:\/\/techcrunch.com\/2025\/07\/02\/could-googles-veo-3-be-the-start-of-playable-world-models\/\" target=\"_blank\" rel=\"noopener\">gaming<\/a> or prototyping creative concepts, its real unlock will manifest in training agents for general-purpose tasks, which he said is essential to reaching AGI.\u00a0<\/p>\n<p class=\"wp-block-paragraph\">\u201cWe think world models are key on the path to AGI, specifically for embodied agents, where simulating real world scenarios is particularly challenging,\u201d Jack Parker-Holder, a research scientist on DeepMind\u2019s open-endedness team, said during the briefing.<\/p>\n<div class=\"wp-block-techcrunch-inline-cta\">\n<div class=\"inline-cta__wrapper\">\n<p>Techcrunch event<\/p>\n<div class=\"inline-cta__content\">\n<p>\n\t\t\t\t\t\t\t\t\t<span class=\"inline-cta__location\">San Francisco<\/span><br \/>\n\t\t\t\t\t\t\t\t\t\t\t\t\t<span class=\"inline-cta__separator\">|<\/span><br \/>\n\t\t\t\t\t\t\t\t\t\t\t\t\t<span class=\"inline-cta__date\">October 27-29, 2025<\/span>\n\t\t\t\t\t\t\t<\/p>\n<\/p><\/div>\n<\/p><\/div>\n<\/div>\n<figure class=\"wp-block-image aligncenter size-large\"><img loading=\"lazy\" decoding=\"async\" height=\"383\" width=\"680\" src=\"https:\/\/techcrunch.com\/wp-content\/uploads\/2025\/08\/Prompt-to-World.gif?w=680\" alt=\"\" class=\"wp-image-3034136\"\/><figcaption class=\"wp-element-caption\"><span class=\"wp-block-image__credits\"><strong>Image Credits:<\/strong>Google DeepMind<\/span><\/figcaption><\/figure>\n<p class=\"wp-block-paragraph\">Genie 3 is supposedly designed to solve that bottleneck. Like Veo, it doesn\u2019t rely on a hard-coded physics engine; instead, DeepMind says, the model teaches itself how the world works \u2014 how objects move, fall, and interact \u2014 by remembering what it has generated and reasoning over long time horizons.\u00a0<\/p>\n<p class=\"wp-block-paragraph\">\u201cThe model is auto-regressive, meaning it generates one frame at a time,\u201d Fruchter told TechCrunch in an interview. \u201cIt has to look back at what was generated before to decide what\u2019s going to happen next. That\u2019s a key part of the architecture.\u201d<\/p>\n<p class=\"wp-block-paragraph\">That memory, the company says, lends to consistency in Genie 3\u2019s simulated worlds, which in turn allows it to develop a grasp of physics, similar to how humans understand that a glass teetering on the edge of a table is about to fall, or that they should duck to avoid a falling object.<\/p>\n<p class=\"wp-block-paragraph\">Notably, DeepMind says the model also has the potential to push AI agents to their limits \u2014 forcing them to learn from their own experience, similar to how humans learn in the real world.<\/p>\n<p class=\"wp-block-paragraph\">As an example, DeepMind shared its test of Genie 3 with a recent version of its generalist <a href=\"https:\/\/deepmind.google\/discover\/blog\/sima-generalist-ai-agent-for-3d-virtual-environments\/\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">Scalable Instructable Multiworld Agent (SIMA)<\/a>, instructing it to pursue a set of goals. In a warehouse setting, they asked the agent to perform tasks like \u201capproach the bright green trash compactor\u201d or \u201cwalk to the packed red forklift.\u201d<\/p>\n<p class=\"wp-block-paragraph\">\u201cIn all three cases, the SIMA agent is able to achieve the goal,\u201d Parker-Holder said. \u201cIt just receives the actions from the agent. So the agent takes the goal, sees the world simulated around it, and then takes the actions in the world. Genie 3 simulates forward, and the fact that it\u2019s able to achieve it is because Genie 3 remains consistent.\u201d\u00a0<\/p>\n<figure class=\"wp-block-image aligncenter size-large is-resized\"><img loading=\"lazy\" decoding=\"async\" width=\"480\" height=\"270\" src=\"https:\/\/techcrunch.com\/wp-content\/uploads\/2025\/08\/Prompt-Event.gif?w=480\" alt=\"\" class=\"wp-image-3034137\" style=\"width:750px;height:auto\"\/><figcaption class=\"wp-element-caption\"><span class=\"wp-block-image__credits\"><strong>Image Credits:<\/strong>Google DeepMind<\/span><\/figcaption><\/figure>\n<p class=\"wp-block-paragraph\">That said, Genie 3 has its limitations. For example, while the researchers claim it can understand physics, the demo showing a skier barreling down a mountain didn\u2019t reflect how snow would move in relation to the skier. <\/p>\n<p class=\"wp-block-paragraph\">Additionally, the range of actions an agent can take is limited. For example, the promptable world events allow for a wide range of environmental interventions, but they\u2019re not necessarily performed by the agent itself. And it\u2019s still difficult to accurately model complex interactions between multiple independent agents in a shared environment.<\/p>\n<p class=\"wp-block-paragraph\">Genie 3 can also only support a few minutes of continuous interaction, when hours would be necessary for proper training.\u00a0<\/p>\n<p class=\"wp-block-paragraph\">Still, the model presents a compelling step forward in teaching agents to go beyond reacting to inputs, letting them potentially plan, explore, seek out uncertainty, and improve through trial and error \u2014 the kind of self-driven, embodied learning that many say is key to moving toward general intelligence.\u00a0<\/p>\n<p class=\"wp-block-paragraph\">\u201cWe haven\u2019t really had a Move 37 moment for embodied agents yet, where they can actually take novel actions in the real world,\u201d Parker-Holder said, referring to the legendary moment in the 2016 game of Go between DeepMind\u2019s AI agent AlphaGo and world champion Lee Sedol, in which Alpha Go played an unconventional and brilliant move that became symbolic of AI\u2019s ability to discover new strategies beyond human understanding.\u00a0<\/p>\n<p class=\"wp-block-paragraph\">\u201cBut now, we can potentially usher in a new era,\u201d he said.\u00a0<\/p>\n<\/div>\n<p><br \/>\n<br \/><a href=\"https:\/\/techcrunch.com\/2025\/08\/05\/deepmind-thinks-genie-3-world-model-presents-stepping-stone-towards-agi\/\" target=\"_blank\" rel=\"noopener\">Source link <\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Google DeepMind has revealed Genie 3, its latest foundation world model that can be used to train general-purpose AI agents, a capability that the AI lab says makes for a crucial stepping stone on the path to \u201cartificial general intelligence,\u201d or human-like intelligence.\u00a0 \u201cGenie 3 is the first real-time interactive general-purpose world model,\u201d Shlomi Fruchter, [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":185149,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[14],"tags":[],"class_list":{"0":"post-185148","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-tech"},"_links":{"self":[{"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/posts\/185148","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/comments?post=185148"}],"version-history":[{"count":0,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/posts\/185148\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/media\/185149"}],"wp:attachment":[{"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/media?parent=185148"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/categories?post=185148"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/tags?post=185148"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}