{"id":72156,"date":"2024-01-30T13:30:15","date_gmt":"2024-01-30T13:30:15","guid":{"rendered":"https:\/\/entertainment.runfyers.com\/index.php\/2024\/01\/30\/inference-ai-matches-ai-workloads-with-cloud-gpu-compute-techcrunch\/"},"modified":"2024-01-30T13:30:15","modified_gmt":"2024-01-30T13:30:15","slug":"inference-ai-matches-ai-workloads-with-cloud-gpu-compute-techcrunch","status":"publish","type":"post","link":"https:\/\/entertainment.runfyers.com\/index.php\/2024\/01\/30\/inference-ai-matches-ai-workloads-with-cloud-gpu-compute-techcrunch\/","title":{"rendered":"Inference.ai matches AI workloads with cloud GPU compute | TechCrunch"},"content":{"rendered":"<p> <br \/>\n<\/p>\n<div id=\"\">\n<div class=\"article__featured-image-wrapper breakout\">\n\t\t\t\n\t\t<\/div>\n<\/p><\/div>\n<div>\n<p id=\"speakable-summary\">GPUs\u2019 ability to perform many computations in parallel make them well-suited to running today\u2019s most capable AI. But GPUs are becoming tougher to procure, as companies of all sizes increase their investments in AI-powered products.<\/p>\n<p id=\"speakable-summary\">Nvidia\u2019s best-performing AI cards <a href=\"https:\/\/www.barrons.com\/articles\/nvidia-ai-chips-coreweave-cloud-6db44825#:~:text=The%20rising%20excitement%20over%20generative,technology%20industry&#039;s%20most%20precious%20resource.\" target=\"_blank\" rel=\"noopener\">sold out<\/a> last year, and the CEO of chipmaker TSMC <a href=\"https:\/\/wccftech.com\/nvidia-ai-gpu-shortage-could-last-till-2025-due-to-supply-constraints-says-tsmc\/\" target=\"_blank\" rel=\"noopener\" data-mrf-link=\"https:\/\/wccftech.com\/nvidia-ai-gpu-shortage-could-last-till-2025-due-to-supply-constraints-says-tsmc\/\">suggested<\/a> that general supply could be constrained into 2025. The problem\u2019s so acute, in fact, that it has the U.S. Federal Trade Commission\u2019s attention \u2014 the agency recently <a href=\"https:\/\/www.jdsupra.com\/legalnews\/ftc-opens-inquiry-into-generative-ai-8613662\/\" target=\"_blank\" rel=\"noopener\">announced<\/a> it\u2019s investigating several partnerships between AI startups and cloud giants like Google and AWS over whether the startups might have anti-competitive, privileged access to GPU compute.<\/p>\n<p>What\u2019s the solution? It depends on your resources, really. Tech giants like Meta, Google, Amazon and Microsoft are <a href=\"https:\/\/techcrunch.com\/2023\/05\/18\/meta-bets-big-on-ai-with-custom-chips-and-a-supercomputer\/\" target=\"_blank\" rel=\"noopener\">buying up what GPUs they can<\/a> and <a href=\"https:\/\/techcrunch.com\/2021\/05\/18\/google-launches-the-next-generation-of-its-custom-ai-chips\/\" target=\"_blank\" rel=\"noopener\">developing<\/a> their <a href=\"https:\/\/techcrunch.com\/2023\/11\/28\/amazon-unveils-new-chips-for-training-and-running-ai-models\/\" target=\"_blank\" rel=\"noopener\">own<\/a> <a href=\"https:\/\/techcrunch.com\/2023\/11\/15\/microsoft-looks-to-free-itself-from-gpu-shackles-by-designing-custom-ai-chips\/#:~:text=Today%20at%20its%202023%20Ignite,to%20run%20general%20purpose%20workloads.\" target=\"_blank\" rel=\"noopener\">custom chips<\/a>. Ventures with fewer resources are at the mercy of the market \u2014 but it doesn\u2019t have to be that way forever, say John Yue and Michael Yu.<\/p>\n<p>Yue and Yu are the co-founders of <a href=\"http:\/\/inference.ai\" target=\"_blank\" rel=\"noopener\">Inference.ai<\/a>, a platform that provides infrastructure-as-a-service cloud GPU compute through partnerships with third-party data centers. Inference uses algorithms to match companies\u2019 workloads with GPU resources, Yue says \u2014 aiming to take the guesswork out of choosing and acquiring infrastructure.<\/p>\n<p>\u201cInference brings clarity to the confusing hardware landscape for founders and developers with new chips coming from Nvidia, Intel, AMD, Groq [and so on] \u2014 allowing higher throughput, lower latency and lower cost,\u201d Yue said. \u201cOur tools and team allow for decision-makers to filter out a lot of the noise and quickly find the right fit for their project.\u201d<\/p>\n<p>Inference essentially provides customers a GPU instance in the cloud, along with 5TB of object storage. The company claims that \u2014 thanks to its algorithmic matching tech and deals with data center operators \u2014 it can offer dramatically cheaper GPU compute with better availability than major public cloud providers.<\/p>\n<p>\u201cThe hosted GPU market is confusing and changes daily,\u201d Yue said. \u201cPlus, we\u2019ve seen pricing vary up to 1000% for the same configuration. Our tools and team allow for decision makers to filter out a lot of the noise and quickly find the right fit for their project.\u201d<\/p>\n<p>Now, TechCrunch wasn\u2019t able to put those claims to the test. But regardless of whether they\u2019re true, Inference has competition \u2014 and lots of it.<\/p>\n<p>See: CoreWeave, a crypto mining operation-turned-GPU provider, which is <a href=\"https:\/\/www.bloomberg.com\/news\/articles\/2023-08-30\/coreweave-said-to-seek-stake-sale-at-up-to-8-billion-valuation\" target=\"_blank\" rel=\"noopener\">reportedly<\/a> expected to rake in around $1.5 billion in revenue by 2024. Its close competitor, Lambda Labs, <a href=\"https:\/\/www.datacenterdynamics.com\/en\/news\/lambda-labs-close-to-raising-300-million-for-ai-cloud\/\" target=\"_blank\" rel=\"noopener\">secured<\/a> $300 million in venture capital last October. There\u2019s also <a href=\"https:\/\/techcrunch.com\/2023\/11\/29\/together-lands-102-5m-investment-to-grow-its-cloud-for-training-generative-ai\/\" target=\"_blank\" rel=\"noopener\">Together<\/a> \u2014 a GPU cloud \u2014 not to mention startups like <a href=\"https:\/\/techcrunch.com\/2022\/07\/21\/run-ai-partners-with-nvidia-as-it-sets-its-sights-on-inferencing\/\" target=\"_blank\" rel=\"noopener\">Run.ai<\/a> and <a href=\"https:\/\/techcrunch.com\/2022\/04\/28\/exafunction-aims-to-reduce-ai-dev-costs-by-abstracting-away-hardware\/\" target=\"_blank\" rel=\"noopener\">Exafunction<\/a>, which aim to reduce AI dev costs by abstracting away the underlying hardware.<\/p>\n<p>Inference\u2019s investors seem to think there\u2019s room for another player, though. The startup recently closed a $4 million round from Cherubic Ventures, Maple VC and Fusion Fund, which Yue says is being put toward build out Inference\u2019s deployment infrastructure.<\/p>\n<p>In an emailed statement, Cherubic\u2019s Matt Cheng added:<\/p>\n<p>\u201cThe requirements for processing capacity will keep on increasing as AI is the foundation of so many of today\u2019s products and systems. We\u2019re confident that the Inference team, with their past knowledge in hardware and cloud infrastructure, has what it takes to succeed. We decided to invest because accelerated computing and storage services are driving the AI revolution, and Inference product will fuel the next wave of AI growth.\u201d<\/p>\n<\/p><\/div>\n<p><br \/>\n<br \/><a href=\"https:\/\/techcrunch.com\/2024\/01\/30\/inference-ai-matches-ai-workloads-with-cloud-gpu-compute\/\" target=\"_blank\" rel=\"noopener\">Source link <\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>GPUs\u2019 ability to perform many computations in parallel make them well-suited to running today\u2019s most capable AI. But GPUs are becoming tougher to procure, as companies of all sizes increase their investments in AI-powered products. Nvidia\u2019s best-performing AI cards sold out last year, and the CEO of chipmaker TSMC suggested that general supply could be [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":72157,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[14],"tags":[],"class_list":{"0":"post-72156","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-tech"},"_links":{"self":[{"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/posts\/72156","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/comments?post=72156"}],"version-history":[{"count":0,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/posts\/72156\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/media\/72157"}],"wp:attachment":[{"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/media?parent=72156"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/categories?post=72156"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/entertainment.runfyers.com\/index.php\/wp-json\/wp\/v2\/tags?post=72156"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}