July17 , 2025

    OpenAI launches a general purpose agent in ChatGPT | TechCrunch

    Related

    Share


    OpenAI is launching a new general purpose AI agent in ChatGPT, which the company says can complete a wide variety of computer-based tasks on behalf of users. OpenAI says the agent can automatically navigate a user’s calendar, generate editable presentations and slideshows, and run code.

    The tool, called ChatGPT agent, combines several capabilities from OpenAI’s previous agentic tools, including Operator’s ability to click around on websites, as well as Deep Research’s ability to synthesize information from dozens of websites into a concise research report. OpenAI says users will be able to interact with the agent simply by prompting ChatGPT in natural language.

    On Thursday, OpenAI is rolling out ChatGPT agent for subscribers to its Pro, Plus, and Team plans. To activate the tool, users can select “agent mode” in ChatGPT’s dropdown menu of tools.

    The launch of ChatGPT agent represents OpenAI’s boldest attempt yet to turn ChatGPT into an agentic product that can take actions and offload tasks for users, rather than just answering questions. In recent years, Silicon Valley companies including OpenAI, Google, and Perplexity have unveiled dozens of AI agents that have promised to do just that. However, these early version of AI agents have proven to struggle with complex tasks, and seem less compelling as products than the ultimate vision tech executives pitch around AI agents.

    That said, OpenAI says ChatGPT agent is far more capable than its previous offerings.

    OpenAI’s new agent can access ChatGPT connectors, allowing users to connect apps like Gmail and GitHub so that the agent can find relevant information to your prompts. Furthermore, OpenAI says ChatGPT agent has access to a terminal, and can use APIs to access certain apps.

    The model underlying ChatGPT agent offers state-of-the-art performance on several benchmarks, according to OpenAI.

    Techcrunch event

    San Francisco
    |
    October 27-29, 2025

    The company says the ChatGPT agent model scores 41.6% on Humanity’s Last Exam (pass@1), a difficult test made up of thousands of questions across more than one hundred subjects. That’s roughly double what OpenAI’s o3 and o4-mini scored on the test.

    On FrontierMath, one of the hardest known math benchmarks, OpenAI says ChatGPT agent scores 27.4% when it has access to tools, such as a terminal for code execution. The previous state-of-the-art score comes from o4-mini, which scored just 6.3%.

    OpenAI notes that it developed ChatGPT agent with safety in mind, largely because the product presents some newfound capabilities that could make it more dangerous in the hands of a bad actor. How capable ChatGPT agent truly is, however, remains to be seen.



    Source link