{"id":162779,"date":"2025-04-17T20:40:19","date_gmt":"2025-04-17T20:40:19","guid":{"rendered":"https:\/\/entertainment.runfyers.com\/index.php\/2025\/04\/17\/openai-launches-flex-processing-for-cheaper-slower-ai-tasks-techcrunch\/"},"modified":"2025-04-17T20:40:19","modified_gmt":"2025-04-17T20:40:19","slug":"openai-launches-flex-processing-for-cheaper-slower-ai-tasks-techcrunch","status":"publish","type":"post","link":"https:\/\/entertainment.runfyers.com\/index.php\/2025\/04\/17\/openai-launches-flex-processing-for-cheaper-slower-ai-tasks-techcrunch\/","title":{"rendered":"OpenAI launches Flex processing for cheaper, slower AI tasks | TechCrunch"},"content":{"rendered":"<p> <br \/>\n<\/p>\n<div>\n<p id=\"speakable-summary\" class=\"wp-block-paragraph\">In a bid to more aggressively compete with rival AI companies like Google, OpenAI is launching <a href=\"https:\/\/platform.openai.com\/docs\/guides\/flex-processing\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">Flex processing<\/a>, an API option that provides lower AI model usage prices in exchange for slower response times and \u201coccasional resource unavailability.\u201d<\/p>\n<p class=\"wp-block-paragraph\">Flex processing, which is available in beta for OpenAI\u2019s recently released <a href=\"https:\/\/techcrunch.com\/2025\/04\/16\/openai-launches-a-pair-of-ai-reasoning-models-o3-and-o4-mini\/\" target=\"_blank\" rel=\"noopener\">o3 and o4-mini<\/a> reasoning models, is aimed at lower-priority and \u201cnon-production\u201d tasks such as model evaluations, data enrichment, and asynchronous workloads, OpenAI says.<\/p>\n<p class=\"wp-block-paragraph\">It reduces API costs by exactly half. For o3, Flex processing is $5\/M input tokens (~750,000 words) and $20\/M output tokens versus the standard $10\/M input tokens and $40\/M output tokens. For o4-mini, Flex brings the price down to $0.55\/M input tokens and $2.20\/M output tokens from $1.10\/M input tokens and $4.40\/M output tokens.<\/p>\n<p class=\"wp-block-paragraph\">The launch of Flex processing comes as the <a href=\"https:\/\/techcrunch.com\/2025\/04\/10\/the-rise-of-ai-reasoning-models-is-making-benchmarking-more-expensive\/\" target=\"_blank\" rel=\"noopener\">price of frontier AI continues to climb<\/a>, and as rivals release cheaper, more efficient budget-oriented models. On Thursday, Google rolled out <a href=\"https:\/\/blog.google\/products\/gemini\/gemini-2-5-flash-preview\/\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">Gemini 2.5 Flash<\/a>, a reasoning model that matches or bests <a href=\"https:\/\/techcrunch.com\/2025\/01\/27\/deepseek-claims-its-reasoning-model-beats-openais-o1-on-certain-benchmarks\/\" target=\"_blank\" rel=\"noopener\">DeepSeek\u2019s R1<\/a> in terms of performance at a lower input token cost.<\/p>\n<p class=\"wp-block-paragraph\">In an <a href=\"https:\/\/x.com\/btibor91\/status\/1912958820437148062\" target=\"_blank\" rel=\"noreferrer noopener nofollow\">email to customers<\/a> announcing the launch of Flex pricing, OpenAI also indicated that developers in tiers 1-3 of its usage tiers hierarchy will have to complete the <a href=\"https:\/\/techcrunch.com\/2025\/04\/13\/access-to-future-ai-models-in-openais-api-may-require-a-verified-id\/\" target=\"_blank\" rel=\"noopener\">newly introduced ID verification process<\/a> to access o3. (Tiers are determined by the amount of money spent on OpenAI services.) O3\u2019s \u2014 and other models\u2019 \u2014reasoning summaries and streaming API support are also gated behind verification.<\/p>\n<p class=\"wp-block-paragraph\">OpenAI previously said ID verification is intended to stop bad actors from violating its usage policies.<\/p>\n<\/div>\n<p><br \/>\n<br \/><a href=\"https:\/\/techcrunch.com\/2025\/04\/17\/openai-launches-flex-processing-for-cheaper-slower-ai-tasks\/\" target=\"_blank\" rel=\"noopener\">Source link <\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>In a bid to more aggressively compete with rival AI companies like Google, OpenAI is launching Flex processing, an API option that provides lower AI model usage prices in exchange for slower response times and \u201coccasional resource unavailability.\u201d Flex processing, which is available in beta for OpenAI\u2019s recently released o3 and o4-mini reasoning models, is [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":162780,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[14],"tags":[],"class_list":{"0":"post-162779","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-tech"},"_links":{"self":[{"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/posts\/162779","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/comments?post=162779"}],"version-history":[{"count":0,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/posts\/162779\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/media\/162780"}],"wp:attachment":[{"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/media?parent=162779"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/categories?post=162779"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/tags?post=162779"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}