Startups

Deepgram lands new cash to grow its enterprise voice-recognition business

Comment

Image Credits: metamorworks / Getty Images

Deepgram, a company developing voice-recognition tech for the enterprise, today raised $47 million in new funding led by Madrona Venture Group with participation from Citi Ventures and Alkeon. An extension of Deepgram’s Series B that kicked off in February 2021, led by Tiger Global, it brings the startup’s total raised to $86 million, which CEO Scott Stephenson says is being put toward R&D in areas like emotion detection, intent recognition, summarization, topic detection, translation and redaction.

“We’re pleased that Deepgram achieved its highest-ever pre- and post-money valuation, even despite the challenging market conditions,” Stephenson told TechCrunch in an email interview. (Unfortunately, he wouldn’t reveal what exactly the valuation was.) “We believe that Deepgram is in a strong position to thrive in this tougher macroeconomic environment. Deepgram’s speech AI is the core enabling technology behind many of our customers’ applications, and the demand for speech understanding grows as companies seek greater efficiency.”

Launched in 2015, Deepgram focuses on building custom voice-recognition solutions for customers such as Spotify, Auth0 and even NASA. The company’s data scientists source, create, label and evaluate speech data to produce speech-recognition models that can understand brands and jargon, capture an array of languages and accents, and adapt to challenging audio environments. For example, for NASA, Deepgram built a model to transcribe communications between Mission Control and the International Space Station.

“Audio data is one of the world’s largest untapped data sources. [But] it’s difficult to use in its audio format because audio is an unstructured data type, and, therefore, can’t be mined for insights without further processing,” Stephenson said. “Deepgram takes unstructured audio data and structures it as text and metadata at high speeds and low costs designed for enterprise scale … [W]ith Deepgram, [companies] can send all their customer audio — hundreds of thousands or millions of hours — to be transcribed and analyzed.”

Where does the audio data to train Deepgram’s models come from? Stephenson was a bit coy there, although he didn’t deny that Deepgram uses customer data to improve its systems. He was quick to point out that the company complies with GDPR and lets users request that their data be deleted at any time.

“Deepgram’s models are primarily trained on data collected or generated by our data curation experts, alongside some anonymized data submitted by our users,” Stephenson said. “Training models on real-world data is a cornerstone of our product’s quality; it’s what allows machine learning systems like ours to produce human-like results. That said, we allow our users to opt out of having their anonymized data used for training if they so choose.”

Through Deepgram’s API, companies can build the platform into their tech stacks to enable voice-based automations and customer experiences. For organizations in heavily regulated sectors, like healthcare and government, Deepgram offers an on-premises deployment option that allows customers to manage and process data locally. (Worth noting, In-Q-Tel, the CIA’s strategic investment arm, has backed Deepgram in the past.)

Deepgram — a Y Combinator graduate founded by Stephenson and Noah Shutty, a University of Michigan physics graduate — competes with a number of vendors in a speech-recognition market that could be worth $48.8 billion by 2030, according to one (optimistic?) source. Tech giants like Nuance, Cisco, Google, Microsoft and Amazon offer real-time voice transcription and captioning services, as do startups like Otter, Speechmatics, Voicera and Verbit.

The tech has hurdles to overcome. According to a 2022 report by Speechmatics, 29% of execs have observed AI bias in voice technologies — specifically imbalances in the types of voices that are understood by speech recognition. But the demand is evidently strong enough to prop up the range of vendors out there; Stephenson claims that Deepgram’s gross margins are “in line with top-performing software businesses.”

That’s in contrast to the consumer voice-recognition market, which has taken a turn for the worse as of late. Amazon’s Alexa division is reportedly on pace to lose $10 billion this year. And Google is rumored to be eyeing cuts to Google Assistant development in favor of more profitable projects.

In recent months, Stephenson says that Deepgram’s focus has been on on-the-fly language translation, sentiment analysis and split transcripts of multiway conversations. The company’s also scaling, now reaching over 300 customers and more than 15,000 users.

On the hunt for new business, Deepgram recently launched the Deepgram Startup Program, which offers $10 million in free speech-recognition credits on Deepgram’s platform to startups in education and corporate. Firms participating don’t need to pay any sort of fee and can use the funds in conjunction with existing grant, seed, incubator and accelerator benefits.

“Deepgram’s business continues to grow rapidly. As a foundational AI infrastructure company, we haven’t seen a reduction in demand for Deepgram,” Stephenson said. “In fact, we’ve watched businesses look for ways to cut costs and delegate repetitive, menial tasks to AIs — giving humans more time to pursue interesting, consequential work. Examples of this include reducing large cloud compute costs by switching big cloud transcription to Deepgram’s transcription product, or in new use cases like drive-thru ordering and triaging the first round of customer service responses.”

Deepgram currently has 146 employees distributed across offices in Ann Arbor and San Francisco. When asked about hiring plans for the rest of the year, Stephenson declined to answer — no doubt cognizant of the unpredictability of the current global economy and the optics of committing to a firm number.

More TechCrunch

Mobile app developers, including Patreon and Grammarly, are already integrating with Gemini Nano, its smallest AI model, the company announced during its I/O developer keynote on Tuesday. The companies, along…

Patreon and Grammarly are already experimenting with Gemini Nano, says Google

As part of the update, Reddit also launched a dedicated AMA tab within the web post composer.

Reddit introduces new tools for ‘Ask Me Anything,’ its Q&A feature

Here are quick hits of the biggest news from the keynote as they are announced.

Google I/O 2024: Here’s everything Google just announced

LearnLM is already powering features across Google products, including in YouTube, Google’s Gemini apps, Google Search and Google Classroom.

LearnLM is Google’s new family of AI models for education

The official launch comes almost a year after YouTube began experimenting with AI-generated quizzes on its mobile app. 

Google is bringing AI-generated quizzes to academic videos on YouTube

Around 550 employees across autonomous vehicle company Motional have been laid off, according to information taken from WARN notice filings and sources at the company.  Earlier this week, TechCrunch reported…

Motional cut about 550 employees, around 40%, in recent restructuring, sources say

The keynote kicks off at 10 a.m. PT on Tuesday and will offer glimpses into the latest versions of Android, Wear OS and Android TV.

Google I/O 2024: Watch all of the AI, Android reveals

It ran 110 minutes, but Google managed to reference AI a whopping 121 times during Google I/O 2024 (by its own count). CEO Sundar Pichai referenced the figure to wrap…

Google mentioned ‘AI’ 120+ times during its I/O keynote

Google Play has a new discovery feature for apps, new ways to acquire users, updates to Play Points, and other enhancements to developer-facing tools.

Google Play preps a new full-screen app discovery feature and adds more developer tools

Soon, Android users will be able to drag and drop AI-generated images directly into their Gmail, Google Messages and other apps.

Gemini on Android becomes more capable and works with Gmail, Messages, YouTube and more

Veo can capture different visual and cinematic styles, including shots of landscapes and timelapses, and make edits and adjustments to already-generated footage.

Google Veo, a serious swing at AI-generated video, debuts at Google I/O 2024

In addition to the body of the emails themselves, the feature will also be able to analyze attachments, like PDFs.

Gemini comes to Gmail to summarize, draft emails, and more

The summaries are created based on Gemini’s analysis of insights from Google Maps’ community of more than 300 million contributors.

Google is bringing Gemini capabilities to Google Maps Platform

Google says that over 100,000 developers already tried the service.

Project IDX, Google’s next-gen IDE, is now in open beta

The system effectively listens for “conversation patterns commonly associated with scams” in-real time. 

Google will use Gemini to detect scams during calls

The standard Gemma models were only available in 2 billion and 7 billion parameter versions, making this quite a step up.

Google announces Gemma 2, a 27B-parameter version of its open model, launching in June

This is a great example of a company using generative AI to open its software to more users.

Google TalkBack will use Gemini to describe images for blind people

Firebase Genkit is an open source framework that enables developers to quickly build AI into new and existing applications.

Google launches Firebase Genkit, a new open source framework for building AI-powered apps

This will enable developers to use the on-device model to power their own AI features.

Google is building its Gemini Nano AI model into Chrome on the desktop

Google’s Circle to Search feature will now be able to solve more complex problems across psychics and math word problems. 

Circle to Search is now a better homework helper

People can now search using a video they upload combined with a text query to get an AI overview of the answers they need.

Google experiments with using video to search, thanks to Gemini AI

A search results page based on generative AI as its ranking mechanism will have wide-reaching consequences for online publishers.

Google will soon start using GenAI to organize some search results pages

Google has built a custom Gemini model for search to combine real-time information, Google’s ranking, long context and multimodal features.

Google is adding more AI to its search results

At its Google I/O developer conference, Google on Tuesday announced the next generation of its Tensor Processing Units (TPU) AI chips.

Google’s next-gen TPUs promise a 4.7x performance boost

Google is upgrading Gemini, its AI-powered chatbot, with features aimed at making the experience more ambient and contextually useful.

Google’s Gemini updates: How Project Astra is powering some of I/O’s big reveals

Veo can generate few-seconds-long 1080p video clips given a text prompt.

Google’s image-generating AI gets an upgrade

At Google I/O, Google announced upgrades to Gemini 1.5 Pro, including a bigger context window. .

Google’s generative AI can now analyze hours of video

The AI upgrade will make finding the right content more intuitive and less of a manual search process.

Google Photos introduces an AI search feature, Ask Photos

Apple released new data about anti-fraud measures related to its operation of the iOS App Store on Tuesday morning, trumpeting a claim that it stopped over $7 billion in “potentially…

Apple touts stopping $1.8B in App Store fraud last year in latest pitch to developers

Online travel agency Expedia is testing an AI assistant that bolsters features like search, itinerary building, trip planning, and real-time travel updates.

Expedia starts testing AI-powered features for search and travel planning