{"id":230801,"date":"2026-03-26T11:30:00","date_gmt":"2026-03-26T11:30:00","guid":{"rendered":"https:\/\/entertainment.runfyers.com\/index.php\/2026\/03\/26\/mistral-releases-a-new-open-source-model-for-speech-generation-techcrunch\/"},"modified":"2026-03-26T11:30:00","modified_gmt":"2026-03-26T11:30:00","slug":"mistral-releases-a-new-open-source-model-for-speech-generation-techcrunch","status":"publish","type":"post","link":"https:\/\/entertainment.runfyers.com\/index.php\/2026\/03\/26\/mistral-releases-a-new-open-source-model-for-speech-generation-techcrunch\/","title":{"rendered":"Mistral releases a new open-source model for speech generation | TechCrunch"},"content":{"rendered":"<p> <br \/>\n<\/p>\n<div>\n<p id=\"speakable-summary\" class=\"wp-block-paragraph\">French AI company Mistral released a new open-source text-to-speech model on Thursday that can be used by voice AI assistants or in enterprise use cases like customer support. The model, which lets enterprises build voice agents for sales and customer engagement, puts Mistral in direct competition with the likes of ElevenLabs, Deepgram, and OpenAI.<\/p>\n<p class=\"wp-block-paragraph\">The new model, called Voxtral TTS, supports nine languages, including English, French, German, Spanish, Dutch, Portuguese, Italian, Hindi, and Arabic. <\/p>\n<p class=\"wp-block-paragraph\">\u201cOur customers have been asking for a speech model. So we built a small-sized speech model that can fit on a smartwatch, a smartphone, a laptop, or other edge devices. The cost of it is a fraction of anything else on the market, but it offers state-of-the-art performance,\u201d Pierre Stock, vp of science operations at Mistral AI, told TechCrunch during a phone interview.<\/p>\n<figure class=\"wp-block-image aligncenter size-large\"><figcaption class=\"wp-element-caption\"><span class=\"wp-element-caption__text\">Image Credits: Mistral<\/span><\/figcaption><\/figure>\n<p class=\"wp-block-paragraph\">Mistral said the new model can adapt a custom voice with a sample of less than five seconds, and also capture characteristics like subtle accents, inflections, intonations, and irregularities in the flow of speech. The model, based on <a rel=\"nofollow noopener\" href=\"https:\/\/docs.mistral.ai\/models\/ministral-3-3b-25-12\" target=\"_blank\">Ministral 3B<\/a>, can switch between languages easily without losing the characteristics of the voice, which is useful for use cases like dubbing or real-time translation. Stock said the company wanted the model to sound human and not robotic.<\/p>\n<figure class=\"wp-block-embed is-type-video is-provider-youtube wp-block-embed-youtube wp-embed-aspect-16-9 wp-has-aspect-ratio\">\n<p>\n<iframe loading=\"lazy\" title=\"Voxtral TTS. Find your voice.\" width=\"696\" height=\"392\" src=\"https:\/\/www.youtube.com\/embed\/_N-ZGjGSVls?feature=oembed\" frameborder=\"0\" allow=\"accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share\" referrerpolicy=\"strict-origin-when-cross-origin\" allowfullscreen><\/iframe>\n<\/p>\n<\/figure>\n<p class=\"wp-block-paragraph\">The model has been built for real-time performance, according to the company. It has a time-to-first-audio (TTFA) \u2014\u00a0a measure of when the model starts \u2018speaking\u2019 after receiving input \u2014 of 90ms for a 10-second sample of 500 characters. The model also has a real-time factor (RTF) of 6x, which means it can render a 10-second clip in roughly 1.6 seconds.<\/p>\n<figure class=\"wp-block-image aligncenter size-large\"><img loading=\"lazy\" decoding=\"async\" height=\"446\" width=\"680\" src=\"https:\/\/techcrunch.com\/wp-content\/uploads\/2026\/03\/1cc0ce97-f6f1-477a-8c25-23ca793d0571.jpeg?w=680\" alt=\"\" class=\"wp-image-3105894\" srcset=\"https:\/\/techcrunch.com\/wp-content\/uploads\/2026\/03\/1cc0ce97-f6f1-477a-8c25-23ca793d0571.jpeg 1600w, https:\/\/techcrunch.com\/wp-content\/uploads\/2026\/03\/1cc0ce97-f6f1-477a-8c25-23ca793d0571.jpeg?resize=150,98 150w, https:\/\/techcrunch.com\/wp-content\/uploads\/2026\/03\/1cc0ce97-f6f1-477a-8c25-23ca793d0571.jpeg?resize=300,197 300w, https:\/\/techcrunch.com\/wp-content\/uploads\/2026\/03\/1cc0ce97-f6f1-477a-8c25-23ca793d0571.jpeg?resize=768,504 768w, https:\/\/techcrunch.com\/wp-content\/uploads\/2026\/03\/1cc0ce97-f6f1-477a-8c25-23ca793d0571.jpeg?resize=680,446 680w, https:\/\/techcrunch.com\/wp-content\/uploads\/2026\/03\/1cc0ce97-f6f1-477a-8c25-23ca793d0571.jpeg?resize=1200,788 1200w, https:\/\/techcrunch.com\/wp-content\/uploads\/2026\/03\/1cc0ce97-f6f1-477a-8c25-23ca793d0571.jpeg?resize=1280,840 1280w, https:\/\/techcrunch.com\/wp-content\/uploads\/2026\/03\/1cc0ce97-f6f1-477a-8c25-23ca793d0571.jpeg?resize=430,282 430w, https:\/\/techcrunch.com\/wp-content\/uploads\/2026\/03\/1cc0ce97-f6f1-477a-8c25-23ca793d0571.jpeg?resize=720,473 720w, https:\/\/techcrunch.com\/wp-content\/uploads\/2026\/03\/1cc0ce97-f6f1-477a-8c25-23ca793d0571.jpeg?resize=900,591 900w, https:\/\/techcrunch.com\/wp-content\/uploads\/2026\/03\/1cc0ce97-f6f1-477a-8c25-23ca793d0571.jpeg?resize=800,525 800w, https:\/\/techcrunch.com\/wp-content\/uploads\/2026\/03\/1cc0ce97-f6f1-477a-8c25-23ca793d0571.jpeg?resize=1536,1008 1536w, https:\/\/techcrunch.com\/wp-content\/uploads\/2026\/03\/1cc0ce97-f6f1-477a-8c25-23ca793d0571.jpeg?resize=668,438 668w, https:\/\/techcrunch.com\/wp-content\/uploads\/2026\/03\/1cc0ce97-f6f1-477a-8c25-23ca793d0571.jpeg?resize=571,375 571w, https:\/\/techcrunch.com\/wp-content\/uploads\/2026\/03\/1cc0ce97-f6f1-477a-8c25-23ca793d0571.jpeg?resize=940,617 940w, https:\/\/techcrunch.com\/wp-content\/uploads\/2026\/03\/1cc0ce97-f6f1-477a-8c25-23ca793d0571.jpeg?resize=708,465 708w, https:\/\/techcrunch.com\/wp-content\/uploads\/2026\/03\/1cc0ce97-f6f1-477a-8c25-23ca793d0571.jpeg?resize=50,33 50w\" sizes=\"auto, (max-width: 680px) 100vw, 680px\"\/><figcaption class=\"wp-element-caption\"><span class=\"wp-element-caption__text\">Image Credits: Mistral AI<\/span><\/figcaption><\/figure>\n<p class=\"wp-block-paragraph\">Earlier this year, Mistral launched <a rel=\"nofollow noopener\" href=\"https:\/\/mistral.ai\/news\/voxtral-transcribe-2\" target=\"_blank\">a pair of transcription models<\/a>, one for large batch processing and the other for real-time use cases with low latency. With the new speech model, the company is likely aiming to provide a full suite of voice products to enterprises.<\/p>\n<p class=\"wp-block-paragraph\">\u201cWe plan to have an end-to-end platform that can handle multimodal streams of input, including audio, text, and image and output as well. The main benefit of that is you get way more information with an end-to-end agentic system that supports audio as an input or output,\u201d Stock said.<\/p>\n<div class=\"wp-block-techcrunch-inline-cta\">\n<div class=\"inline-cta__wrapper\">\n<p>Techcrunch event<\/p>\n<div class=\"inline-cta__content\">\n<p>\n\t\t\t\t\t\t\t\t\t<span class=\"inline-cta__location\">San Francisco, CA<\/span><br \/>\n\t\t\t\t\t\t\t\t\t\t\t\t\t<span class=\"inline-cta__separator\">|<\/span><br \/>\n\t\t\t\t\t\t\t\t\t\t\t\t\t<span class=\"inline-cta__date\">October 13-15, 2026<\/span>\n\t\t\t\t\t\t\t<\/p>\n<\/p><\/div>\n<\/p><\/div>\n<\/div>\n<p class=\"wp-block-paragraph\">Mistral\u2019s positioning is that its open source and customization bit will help enterprises adopt its voice models over competitors, as they can tune it the way they want.<\/p>\n<\/div>\n<p><br \/>\n<br \/><a href=\"https:\/\/techcrunch.com\/2026\/03\/26\/mistral-releases-a-new-open-source-model-for-speech-generation\/\" target=\"_blank\" rel=\"noopener\">Source link <\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>French AI company Mistral released a new open-source text-to-speech model on Thursday that can be used by voice AI assistants or in enterprise use cases like customer support. The model, which lets enterprises build voice agents for sales and customer engagement, puts Mistral in direct competition with the likes of ElevenLabs, Deepgram, and OpenAI. The [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":230803,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[14],"tags":[],"class_list":{"0":"post-230801","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-tech"},"_links":{"self":[{"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/posts\/230801","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/comments?post=230801"}],"version-history":[{"count":0,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/posts\/230801\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/media\/230803"}],"wp:attachment":[{"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/media?parent=230801"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/categories?post=230801"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/tags?post=230801"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}