{"id":64917,"date":"2023-12-28T19:31:20","date_gmt":"2023-12-28T19:31:20","guid":{"rendered":"https:\/\/entertainment.runfyers.com\/index.php\/2023\/12\/28\/giga-ml-wants-to-help-companies-deploy-llms-offline-techcrunch\/"},"modified":"2023-12-28T19:31:20","modified_gmt":"2023-12-28T19:31:20","slug":"giga-ml-wants-to-help-companies-deploy-llms-offline-techcrunch","status":"publish","type":"post","link":"https:\/\/entertainment.runfyers.com\/index.php\/2023\/12\/28\/giga-ml-wants-to-help-companies-deploy-llms-offline-techcrunch\/","title":{"rendered":"Giga ML wants to help companies deploy LLMs offline | TechCrunch"},"content":{"rendered":"<p> <br \/>\n<\/p>\n<div>\n<p id=\"speakable-summary\">AI is all the rage \u2014 particularly text-generating AI, also known as large language models (think models along the lines of <a href=\"https:\/\/techcrunch.com\/tag\/chatgpt\/\" target=\"_blank\" rel=\"noopener\">ChatGPT<\/a>). In one recent <a href=\"https:\/\/ai-infrastructure.org\/enterprise-generative-ai-adoption-report-aug-2023\/\" target=\"_blank\" rel=\"noopener\">survey<\/a> of ~1,000 enterprise organizations, 67.2% say that they see adopting large language models (LLMs) as a top priority by early 2024.<\/p>\n<p>But barriers stand in the way. According to the same survey, a lack of customization and flexibility, paired with the inability to preserve company knowledge and IP, were \u2014 and are \u2014 preventing many businesses from deploying LLMs into production.<\/p>\n<p>That got Varun Vummadi and Esha Manideep Dinne thinking: what might a solution to the enterprise LLM adoption challenge look like? In search of one, they founded <a href=\"https:\/\/gigaml.com\/\" target=\"_blank\" rel=\"noopener\">Giga ML<\/a>, a startup building a platform that lets companies deploy LLMs on-premise \u2014 ostensibly cutting costs and preserving privacy in the process.<\/p>\n<p>\u201cData privacy and customizing LLMs are some of the biggest challenges faced by enterprises when adopting LLMs to solve problems,\u201d Vummadi told TechCrunch in an email interview. \u201cGiga ML addresses both of these challenges.\u201d<\/p>\n<p>Giga ML offers its own set of LLMs, the \u201cX1 series,\u201d for tasks like generating code and answering common customer questions (e.g. \u201cWhen can I expect my order to arrive?\u201d). The startup claims the models, built atop Meta\u2019s <a href=\"https:\/\/techcrunch.com\/2023\/07\/18\/meta-releases-llama-2-a-more-helpful-set-of-text-generating-models\/\" target=\"_blank\" rel=\"noopener\">Llama 2<\/a>, outperform popular LLMs on certain benchmarks, particularly the <a href=\"https:\/\/klu.ai\/glossary\/mt-bench-eval\" target=\"_blank\" rel=\"noopener\">MT-Bench<\/a> test set for dialogs. But it\u2019s tough to say how X1 compares qualitatively; this reporter tried Giga ML\u2019s <a href=\"https:\/\/www.chat.gigaml.com\/\" target=\"_blank\" rel=\"noopener\">online demo<\/a> but ran into technical issues. (The app timed out no matter what prompt I typed.)<\/p>\n<p>Even if Giga ML\u2019s models\u00a0<em>are\u00a0<\/em>superior in some aspects, though, can they really make a splash in the <a href=\"https:\/\/techcrunch.com\/2023\/11\/05\/valued-at-1b-kai-fu-lees-llm-startup-unveils-open-source-model\/\" target=\"_blank\" rel=\"noopener\">ocean<\/a> of <a href=\"https:\/\/techcrunch.com\/2023\/08\/24\/meta-releases-code-llama-a-code-generating-ai-model\/\" target=\"_blank\" rel=\"noopener\">open source<\/a>, <a href=\"https:\/\/techcrunch.com\/2023\/07\/12\/lince-llm\/\" target=\"_blank\" rel=\"noopener\">offline<\/a> <a href=\"https:\/\/techcrunch.com\/2023\/09\/27\/mistral-ai-makes-its-first-large-language-model-free-for-everyone\/\" target=\"_blank\" rel=\"noopener\">LLMs<\/a>?<\/p>\n<p>In talking to Vummadi, I got the sense that Giga ML isn\u2019t so much trying to create the best-performing LLMs out there but instead building tools to allow businesses to fine-tune LLMs locally without having to rely on third-party resources and platforms.<\/p>\n<p><span style=\"font-weight: 400;\">\u201cGiga ML\u2019s mission is to help enterprises safely and efficiently deploy LLMs on their own on-premises infrastructure or virtual private cloud,\u201d Vummadi said. \u201cGiga ML simplifies the process of training, fine-tuning and running LLMs by taking care of it through an easy-to-use API, eliminating any associated hassle.\u201d<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Vummadi emphasized the privacy advantages of running models offline \u2014 advantages likely to be persuasive for some businesses.<\/span><\/p>\n<p><span style=\"font-weight: 400;\">Predibase, the low-code AI dev platform, found that less than a quarter of enterprises are comfortable using commercial LLMs because of concerns over sharing sensitive or proprietary data with vendors. Nearly 77% of respondents to the survey said that they either don\u2019t use or don\u2019t plan to use commercial LLMs beyond prototypes in production \u2014\u00a0 citing issues relating to privacy, cost, and lack of customization.<\/span><\/p>\n<p>\u201cIT managers at the C-suite level find Giga ML\u2019s offerings valuable because of the secure on-premise deployment of LLMs, customizable models tailored to their specific use case and fast inference, which ensures data compliance and maximum efficiency,\u201d <span style=\"font-weight: 400;\">Vummadi said.\u00a0<\/span><\/p>\n<p>Giga ML, which has raised ~$3.74 million in VC funding to date from Nexus Venture Partners, Y Combinator, Liquid 2 Ventures, 8vdx and several others, plans in the near term to grow its two-person team and ramp up product R&amp;D. A portion of the capital is going toward supporting Giga ML\u2019s customer base, as well, <span style=\"font-weight: 400;\">Vummadi said, which currently includes unnamed \u201centerprise\u201d companies in finance and healthcare.<\/span><\/p>\n<\/p><\/div>\n<p><br \/>\n<br \/><a href=\"https:\/\/techcrunch.com\/2023\/12\/28\/giga-ml-wants-to-help-companies-deploy-llms-offline\/\" target=\"_blank\" rel=\"noopener\">Source link <\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>AI is all the rage \u2014 particularly text-generating AI, also known as large language models (think models along the lines of ChatGPT). In one recent survey of ~1,000 enterprise organizations, 67.2% say that they see adopting large language models (LLMs) as a top priority by early 2024. But barriers stand in the way. According to [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":64918,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[14],"tags":[],"class_list":{"0":"post-64917","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-tech"},"_links":{"self":[{"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/posts\/64917","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/comments?post=64917"}],"version-history":[{"count":0,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/posts\/64917\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/media\/64918"}],"wp:attachment":[{"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/media?parent=64917"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/categories?post=64917"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/tags?post=64917"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}