AI

Deci lands $25M for tech that makes AI models more efficient

Comment

Futuristic digital blockchain background. Abstract connections technology and digital network. 3d illustration of the Big data and communications technology.
Image Credits: v_alex / Getty Images

Deci, a startup company with 50 employees who are developing a platform to build and optimize AI-powered systems, today announced that it closed a $25 million Series B financing round led by Insight Partners with participation from Square Peg, Emerge, Jibe Ventures, Fort Ross Ventures and ICON that brings the company’s total raised to $55.1 million. The funds will be used to expand Deci’s go-to-market activities as well as support the company’s R&D efforts, according to co-founder and CEO Yonatan Geifman.

Companies face several hurdles in creating text-, audio- and image-analyzing AI models for deployment across their apps and services. Cost is an outsize one — training a single model on commercial hardware can cost tens of thousands of dollars, if not more. While newer generations of chips and custom-designed AI accelerators have helped to reduce the burden somewhat, creating a model from scratch is still no easy feat.

Geifman proposes neural architecture search (NAS) as a solution. NAS, a family of techniques on which Deci heavily relies, can help automatically discover low-cost, optimal models for a given problem. Deci isn’t unique in this — Google’s Vertex AI service leverages NAS to optimize the performance of models on specific, customer-specified tasks. But Geifman argues that Deci’s platform offers access to NAS capabilities at a lower cost.

In 2019, Geifman co-founded Deci alongside Ran El-Yaniv and entrepreneur Jonathan Elial. Geifman and El-Yaniv met at Technion’s computer science department, where Geifman was a Ph.D. candidate and El-Yaniv a professor.

“Deci’s proprietary technology [can generate] new image classification models that … deliver more than 2x improvement in runtime, coupled with improved accuracy, as compared to the most powerful models publicly available,” Geifman told TechCrunch in an email. “This means that AI applications that previously could only be deployed on large and expensive GPUs can now be deployed on CPUs.”

Deci
Image Credits: Deci

Those are lofty claims. But Deci has the backing of Intel, which last March announced a strategic business and technology collaboration with the startup to optimize machine learning on Intel processors. The partnership led to the creation of a model that accelerates question-answering tasks’ performance on Intel CPUs and an image classification model for Cascade Lake processors that “significantly reduces compute overhead,” Geifman claims.

Geifman previously told TechCrunch that one of Deci’s customers, a videoconferencing provider, used the platform to roll out a feature that blurs backgrounds on users’ devices. Others have tapped Deci to build better models for their own internal computing, even when they theoretically have the GPUs and compute power on hand to run anything.

Deci was created to empower developers and eliminate production-related bottlenecks across the AI lifecycle,” Geifman said. “The business impact of this ability translates into … shortening time to production and the ability to unlock new AI use cases and address new market segments on resource-constrained devices.”

Geifman also notes that compressed models can help companies save on inference compute costs — that is, the costs of actually serving models once they’ve been deployed. Owing in part to the popularity of hosting models in the cloud, over a third of businesses regularly have cloud budget overruns of up to 40%, according to a poll by observability software vendor Pepperdata.

While Geifman asserts that Deci’s business continues to grow, the startup faces challenges, including the technical limitations of NAS. (NAS, which is difficult to evaluate, can be expensive and time consuming.) Moreover, Deci also competes with a number of companies developing ways to make models more efficient, like OctoML, Neural Magic and OmniML.

The coming months will be a test of Deci’s robustness to headwinds.

“While we cannot disclose the valuation, we can say it increased significantly when compared to the previous round. Due to the growth in Deci’s business and the product expansion opportunities into additional domains such as natural language processing, among others, our existing investors decided to double down to support that growth,” Geifman said. “We haven’t seen a major impact [from recent economic developments]. Our focus has largely been on enterprise, while the slowdown has mainly affected mid-market companies and startups.”

Insight Partners managing director Lonne Jaffe, a board member at Deci, added in an email with TechCrunch: “Deci’s powerful technology lets you input your AI models, data and target hardware — whether that hardware is on the edge or in the cloud — and guides you in finding alternative models that will generate similar predictive accuracy with massively improved efficiency … [It’s a value add because] having a more efficient infrastructure for AI systems can make AI products qualitatively different and better, not only cheaper and faster to run.” 

More TechCrunch

Featured Article

What to expect from WWDC 2024: iOS 18, macOS 15 and so much AI

Apple is hoping to make WWDC 2024 memorable as it finally spells out its generative AI plans.

23 mins ago
What to expect from WWDC 2024: iOS 18, macOS 15 and so much AI

We just announced the breakout session winners last week. Now meet the roundtable sessions that really “rounded” out the competition for this year’s Disrupt 2024 audience choice program. With five…

The votes are in: Meet the Disrupt 2024 audience choice roundtable winners

The malicious attack appears to have involved malware transmitted through TikTok’s DMs.

TikTok acknowledges exploit targeting high-profile accounts

It’s unusual for three major AI providers to all be down at the same time, which could signal a broader infrastructure issues or internet-scale problem.

AI apocalypse? ChatGPT, Claude and Perplexity all went down at the same time

Welcome to TechCrunch Fintech! This week, we’re looking at LoanSnap’s woes, Nubank’s and Monzo’s positive milestones, a plethora of fintech fundraises and more! To get a roundup of TechCrunch’s biggest…

A look at LoanSnap’s troubles and which neobanks are having a moment

Databricks, the analytics and AI giant, has acquired data management company Tabular for an undisclosed sum. (CNBC reports that Databricks payed over $1 billion.) According to Tabular co-founder Ryan Blue,…

Databricks acquires Tabular to build a common data lakehouse standard

ChatGPT, OpenAI’s text-generating AI chatbot, has taken the world by storm. What started as a tool to hyper-charge productivity through writing essays and code with short text prompts has evolved…

ChatGPT: Everything you need to know about the AI-powered chatbot

The next few weeks could be pivotal for Worldcoin, the controversial eyeball-scanning crypto venture co-founded by OpenAI’s Sam Altman, whose operations remain almost entirely shuttered in the European Union following…

Worldcoin faces pivotal EU privacy decision within weeks

OpenAI’s chatbot ChatGPT has been down for several users across the globe for the last few hours.

OpenAI fixes the issue that caused ChatGPT outage for several hours

True Fit, the AI-powered size-and-fit personalization tool, has offered its size recommendation solution to thousands of retailers for nearly 20 years. Now, the company is venturing into the generative AI…

True Fit leverages generative AI to help online shoppers find clothes that fit

Audio streaming service TuneIn is teaming up with Discord to bring free live radio to the platform. This is TuneIn’s first collaboration with a social platform and one that is…

Discord and TuneIn partner to bring live radio to the social platform

The early victors in the AI gold rush are selling the picks and shovels needed to develop and apply artificial intelligence. Just take a look at data-labeling startup Scale AI…

Scale AI founder Alexandr Wang is coming to Disrupt 2024

Try to imagine the number of parts that go into making a rocket engine. Now imagine requesting and comparing quotes for each of those parts, getting approvals to purchase the…

Engineer brothers found Forge to modernize hardware procurement

Raspberry Pi has released a $70 AI extension kit with a neural network inference accelerator that can be used for local inferencing, for the Raspberry Pi 5.

Raspberry Pi partners with Hailo for its AI extension kit

When Stacklet’s founders, Travis Stanfield and Kapil Thangavelu, came out of Capital One in 2020 to launch their startup, most companies weren’t all that concerned with constraining cloud costs. But…

Stacklet sees demand grow as companies take cloud cost control more seriously

Fivetran’s Managed Data Lake Service aims to remove the repetitive work of managing data lakes.

Fivetran launches a managed data lake service

Lance Riedel and Nigel Daley both spent decades in search discovery, but it was while working at Pinterest that they began trying to understand how to use search engines to…

How a couple of former Pinterest search experts caught Biz Stone’s attention

GetWhy helps businesses carry out market studies and extract insights from video-based interviews using AI.

GetWhy, a market research AI platform that extracts insights from video interviews, raises $34.5M

AI-powered virtual physical therapy platform Sword Health has seen its valuation soar 50% to $3 billion.

Sword Health raises $130 million and its valuation soars to $3 billion

Jeffrey Katzenberg and Sujay Jaswa, along with three general partners, manage $1.5 billion in assets today through their Build, Venture and Seed strategies.

WndrCo officially gets into venture capital with fresh $460M across two funds

The startup targets the middle ground between platforms that offer rigid templates, and those that facilitate a full-control approach.

Storyblok raises $80M to add more AI to its ‘headless’ CMS aimed at non-technical people

The startup has been pursuing a ground-up redesign of a well-understood technology.

‘Star Wars’ lasers and waterfalls of molten salt: How Xcimer plans to make fusion power happen

Sékr, a startup that offers a mobile app for outdoor enthusiasts and campers, is launching a new AI tool for planning road trips. The new tool, called Copilot, is available…

Travel app Sékr can plan your next road trip with its new AI tool

Microsoft’s education-focused flavor of its cloud productivity suite, Microsoft 365 Education, is facing investigation in the European Union. Privacy rights non-profit noyb has just lodged two complaints with Austria’s data…

Microsoft hit with EU privacy complaints over schools’ use of 365 Education suite

Since the shock of Russia’s 2022 invasion of Ukraine, solar energy has been having a moment in Europe. Electricity prices have been going up while the investment required to get…

Samara is accelerating the energy transition in Spain one solar panel at a time

Featured Article

DEI backlash: Stay up-to-date on the latest legal and corporate challenges

It’s clear that this year will be a turning point for DEI.

22 hours ago
DEI backlash: Stay up-to-date on the latest legal and corporate challenges

The keynote will be focused on Apple’s software offerings and the developers that power them, including the latest versions of iOS, iPadOS, macOS, tvOS, visionOS and watchOS.

Watch Apple kick off WWDC 2024 right here

Hello and welcome back to TechCrunch Space. Unfortunately, Boeing’s Starliner launch was delayed yet again, this time due to issues with one of the three redundant computers used by United…

TechCrunch Space: China’s victory

The court ruling said that Fearless Fund’s Strivers Grant likely violates the Civil Rights Act of 1866, which bans the use of race in contracts.

An appeals court rules that VC Fearless Fund cannot issue grants to Black women, but the fight continues

Instagram Threads is rolling out the ability for users to signal which sort of posts they wanted to see more or less of by swiping.

You can now customize your For You feed on Threads using swipes