Startups

NeuReality lands $35M to bring AI accelerator chips to market

Comment

computer circuit board
Image Credits: CHRISTOPH BURGSTEDT/SCIENCE PHOTO LIBRARY / Getty Images

The growing demand for AI, particularly generative AI (i.e., AI that generates images, text and more), is supercharging the AI inferencing chip market. Inferencing chips accelerate the AI inferencing process, which is where AI systems generate outputs (e.g., text, images, audio) based on what they learned while “training” on a specific set of data. AI inferencing chips can be — and have been — used to yield faster generations from systems such as Stable Diffusion, which translates text prompts into artwork, and OpenAI’s GPT-3, which extends a few lines of prose into full-length poems, essays and more.

A number of vendors — both startups and well-established players — are actively developing and selling access to AI inferencing chips. There’s Hailo, Mythic and Flex Logix, to name a few upstarts. And on the incumbent side, Google’s competing for dominance with its tensor processing units (TPUs) while Amazon’s betting on Inferentia. But the competition, while fierce, hasn’t scared away firms like NeuReality, which occupy the AI chip inferencing market but aim to differentiate themselves by offering a suite of software and services to support their hardware.

On the subject, NeuReality today announced that it raised $35 million in a Series A funding round led by Samsung Ventures, Cardumen Capital, Varana Capital, OurCrowd and XT Hi-Tech with participation from SK Hynix, Cleveland Avenue, Korean Investment Partners, StoneBridge, and Glory Ventures. Co-founder and CEO Moshe Tanach tells TechCrunch that the tranche will be put toward finalizing the design of NeuReality’s flagship AI inferencing chip in early 2023 and shipping it to customers.

NeuReality was founded with the vision to build a new generation of AI inferencing solutions that are unleashed from traditional CPU-centric architectures and deliver high performance and low latency, with the best possible efficiency in cost and power consumption,” Tanach told TechCrunch via email. “Most companies that can leverage AI don’t have the funds nor the huge R&D that Amazon, Meta and other huge companies investing in AI have. NeuReality will bring AI tech to anyone who wants to deploy easily and affordably.”

NeuReality was co-founded in 2019 by Tzvika Shmueli, Yossi Kasus and Tanach, who previously served as a director of engineering at Marvell and Intel. Shmueli was formerly the VP of back-end infrastructure at Mellanox Technologies and the VP of engineering at Habana Labs. As for Kasus, he held a senior director of engineering role at Mellanox and was the head of integrations at semiconductor company EZchip.

From the start, NeuReality focused on bringing to market AI hardware for cloud data centers and “edge” computers, or machines that run on-premises and do most of their data processing offline. Tanach says that the startup’s current-generation product lineup, the Network Attached Processing Unit (NAPU), is optimized for AI inference applications, including computer vision (think algorithms that recognize objects in photos), natural language processing (text-generating and classifying systems) and recommendation engines (like the type that suggest products on e-commerce sites).

NeuReality’s NAPU is essentially a hybrid of multiple types of processors. It can perform functions like AI inferencing load balancing, job scheduling and queue management, which have traditionally been done in software but not necessarily very efficiently.

NeuReality
Image Credits: NeuReality

NeuReality’s NR1, an FPGA-based SKU within the NAPU family, is a network-attached “server on a chip” with an embedded AI inferencing accelerator along with networking and virtualization capabilities. NeuReality also offers the NR1-M module, a PCIe card containing an NR1 and a network-attached inference server, and a separate module — the NR1-S — that pairs several NR1-Ms with the NR1.

On the software side, NeuReality delivers a set of tools, including a software development kit for cloud and local workloads, a deployment manager to help with runtime issues and a monitoring dashboard.

“The software for AI inference [and] the tools for heterogeneous compute and automated flow of compilation and deployment … is the magic that supports our innovative hardware approach,” Tanach said. “The first beneficiaries of the NAPU technology are enterprises and cloud solution providers that need infrastructure to support their chatbots, voice bots, automatic transcriptions and sentiment analysis as well as computer vision use cases for document scans, defect detection, etc.  … While the world was focusing on the deep learning processor improvements, NeuReality focused on optimizing the system around it and the software layers above it to provide higher efficiency and a much easier flow to deploy inference.”

NeuReality, it must be noted, has yet to back up some of its performance claims with empirical evidence. It told ZDNet in a recent article that it estimates its hardware will deliver a 15x improvement in performance per dollar compared to the available GPUs and ASICs offered by deep learning accelerator vendors, but NeuReality hasn’t released validating benchmarking data. The startup also hasn’t detailed its proprietary networking protocol, a protocol that it has previously claimed is more performant than existing solutions.

Those items aside, delivering hardware at massive scale isn’t easy — particularly where it involves custom AI inferencing chips. But Tanach argues that NeuReality has laid the necessary groundwork, partnering with AMD-owned semiconductor manufacturer Xilinx for production and inking a partnership with IBM to work on hardware requirements for the NR1. (IBM, which is also a NeuReality design partner, previously said it’s “evaluating” the startup’s products for use in the IBM cloud.) NeuReality has been shipping prototypes to partners since May 2021, Tanach says.

According to Tanach, beyond IBM, NeuReality is working with Lenovo, AMD and unnamed cloud solution providers, system integrators, deep learning accelerator vendors and “inference-consuming” enterprises on deployments. Tanach declined, however, to reveal how many customers the startup currently has or what roughly it’s projecting in terms of revenue.

“We see that the pandemic is slowing companies down and pushing for consolidation between the many deep learning vendors. However, for us it doesn’t change anything, since late next year or sometime through 2024 inference deployment is expected to explode — and our technology is exactly the enabler and driver of that growth,” Tanach said. “The NAPU will bring AI for a broader set of less technical companies. It is also set to allow large-scale users such as ‘hyperscalers’ and next-wave data center customers to support their growing scale of AI usage.”

Ori Kirshner, the head of Samsung Ventures in Israel, added in an emailed statement: “We see substantial and immediate need for higher efficiency and easy-to-deploy inference solutions for data centers and on-premises use cases, and this is why we are investing in NeuReality. The company’s innovative disaggregation, data movement and processing technologies improve computation flows, compute-storage flows, and in-storage compute — all of which are critical for the ability to adopt and grow AI solutions.”

NeuReality, which currently has 40 employees, plans to hire 20 more over the next two fiscal quarters. To date, it’s raised $48 million in venture capital.

More TechCrunch

Zoox, Amazon’s self-driving unit, is bringing its autonomous vehicles to more cities.  The self-driving technology company announced Wednesday plans to begin testing in Austin and Miami this summer. The two…

Zoox to test self-driving cars in Austin and Miami 

Called Stable Audio Open, the generative model takes a text description and outputs a recording up to 47 seconds in length.

Stability AI releases a sound generator

It’s not just instant-delivery startups that are struggling. Oda, the Norway-based online supermarket delivery startup, has confirmed layoffs of 150 jobs as it drastically scales back its expansion ambitions to…

SoftBank-backed grocery startup Oda lays off 150, resets focus on Norway and Sweden

Newsletter platform Substack is introducing the ability for writers to send videos to their subscribers via Chat, its direct messaging feature, the company announced on Wednesday. The rollout of video…

Substack brings video to its Chat feature

Hiya, folks, and welcome to TechCrunch’s inaugural AI newsletter. It’s truly a thrill to type those words — this one’s been long in the making, and we’re excited to finally…

This Week in AI: Ex-OpenAI staff call for safety and transparency

Ms. Rachel isn’t a household name, but if you spend a lot of time with toddlers, she might as well be a rockstar. She’s like Steve from Blues Clues for…

Cameo fumbles on Ms. Rachel fundraiser as fans receive credits instead of videos  

Cartwheel helps animators go from zero to basic movement, so creating a scene or character with elementary motions like taking a step, swatting a fly or sitting down is easier.

Cartwheel generates 3D animations from scratch to power up creators

The new tool, which is set to arrive in Wix’s app builder tool this week, guides users through a chatbot-like interface to understand the goals, intent and aesthetic of their…

Wix’s new tool taps AI to generate smartphone apps

ClickUp Knowledge Management combines a new wiki-like editor and with a new AI system that can also bring in data from Google Drive, Dropbox, Confluence, Figma and other sources.

ClickUp wants to take on Notion and Confluence with its new AI-based Knowledge Base

New York City, home to over 60,000 gig delivery workers, has been cracking down on cheap, uncertified e-bikes that have resulted in battery fires across the city.  Some e-bike providers…

Whizz wants to own the delivery e-bike subscription space, starting with NYC

This is the last major step before Starliner can be certified as an operational crew system, and the first Starliner mission is expected to launch in 2025. 

Boeing’s Starliner astronaut capsule is en route to the ISS 

TechCrunch Disrupt 2024 in San Francisco is the must-attend event for startup founders aiming to make their mark in the tech world. This year, founders have three exciting ways to…

Three ways founders can shine at TechCrunch Disrupt 2024

Google’s newest startup program, announced on Wednesday, aims to bring AI technology to the public sector. The newly launched “Google for Startups AI Academy: American Infrastructure” will offer participants hands-on…

Google’s new startup program focuses on bringing AI to public infrastructure

eBay’s newest AI feature allows sellers to replace image backgrounds with AI-generated backdrops. The tool is now available for iOS users in the U.S., U.K., and Germany. It’ll gradually roll…

eBay debuts AI-powered background tool to enhance product images

If you’re anything like me, you’ve tried every to-do list app and productivity system, only to find yourself giving up sooner than later because sooner than later, managing your productivity…

Hoop uses AI to automatically manage your to-do list

Asana is using its work graph to train LLMs with the goal of creating AI assistants that work alongside human employees in company workflows.

Asana introduces ‘AI teammates’ designed to work alongside human employees

Taloflow, an early stage startup changing the way companies evaluate and select software, has raised $1.3M in a seed round.

Taloflow puts AI to work on software vendor selection to reduce cost and save time

The startup is hoping its durable filters can make metals refining and battery recycling more efficient, too.

SiTration uses silicon wafers to reclaim critical minerals from mining waste

Spun out of Bosch, Dive wants to change how manufacturers use computer simulations by both using modern mathematical approaches and cloud computing.

Dive goes cloud-native for its computational fluid dynamics simulation service

The tension between incumbents and fintechs has existed for decades. But every once in a while, the two groups decide to put their competition aside and work together. In an…

When foes become friends: Capital One partners with fintech giants Stripe, Adyen to prevent fraud

After growing 500% year-over-year in the past year, Understory is now launching a product focused on the renewable energy sector.

Insurance provider Understory gets into renewable energy following $15M Series A

Ashkenazi will start her new role at Google’s parent company on July 31, after 23 years at Eli Lilly.

Alphabet brings on Eli Lilly’s Anat Ashkenazi as CFO

Tobiko aims to reimagine how teams work with data by offering a dbt-compatible data transformation platform.

With $21.8M in funding, Tobiko aims to build a modern data platform

In 1816, French physician René Laennec invented an instrument that allowed doctors to listen to the heart and lungs. That device — a stethoscope — eventually evolved from a simple…

Eko Health scores $41M to detect heart and lung disease earlier and more accurately

The number of satellites on low Earth orbit is poised to explode over the coming years as more mega-constellations come online. This will create new opportunities for bad actors to…

DARPA and Slingshot build system to detect ‘wolf in sheep’s clothing’ adversary satellites

SAP sees WalkMe’s focus on automating contextual, in-app support as bringing value to its own enterprise customers.

SAP to acquire digital adoption platform WalkMe for $1.5B

The National Democratic Alliance (NDA) has emerged victorious in India’s 2024 general election, but with a smaller majority compared to 2019. According to post-election analysis by Goldman Sachs, JPMorgan, CLSA,…

Modi-led coalition’s election win signals policy continuity in India — and spending cuts

Featured Article

A comprehensive list of 2024 tech layoffs

The tech layoff wave is still going strong in 2024. Following significant workforce reductions in 2022 and 2023, this year has already seen 60,000 job cuts across 254 companies, according to independent layoffs tracker Layoffs.fyi. Companies like Tesla, Amazon, Google, TikTok, Snap and Microsoft have conducted sizable layoffs in the…

21 hours ago
A comprehensive list of 2024 tech layoffs

Featured Article

What to expect from WWDC 2024: iOS 18, macOS 15 and so much AI

Apple is hoping to make WWDC 2024 memorable as it finally spells out its generative AI plans.

22 hours ago
What to expect from WWDC 2024: iOS 18, macOS 15 and so much AI

We just announced the breakout session winners last week. Now meet the roundtable sessions that really “rounded” out the competition for this year’s Disrupt 2024 audience choice program. With five…

The votes are in: Meet the Disrupt 2024 audience choice roundtable winners