Featured Article

Is there a future in light-powered AI chips?

Photonics is proving to be a tough nut to crack

Comment

Central Computer Processor digital technology and innovations
Image Credits: Olemedia / Getty Images

The growing compute power necessary to train sophisticated AI models such as OpenAI’s ChatGPT might eventually run up against a wall with mainstream chip technologies.

In a 2019 analysis, OpenAI found that from 1959 to 2012, the amount of power used to train AI models doubled every two years, and that the power usage began rising seven times faster after 2012.

It’s already causing strain. Microsoft is reportedly facing an internal shortage of the server hardware needed to run its AI, and the scarcity is driving prices up. CNBC, speaking to analysts and technologists, estimates the current cost of training a ChatGPT-like model from scratch to be over $4 million.

One solution to the AI training dilemma that’s been proposed is photonic chips, which use light to send signals rather than the electricity that conventional processors use. Photonic chips could in theory lead to higher training performance because light produces less heat than electricity, can travel faster and is far less susceptible to changes in temperature and electromagnetic fields.

LightmatterLightOn, Luminous Computing, Intel and NTT are among the companies developing photonic technologies. But while the technology generated much excitement a few years ago — and attracted a lot of investment — the sector has cooled noticeably since then.

There are various reasons why, but the general message from investors and analysts studying photonics is that photonic chips for AI, while promising, aren’t the panacea they were once believed to be.

“One important insight for us has been that, for data center applications — and this is where most people think photonic AI compute would likely be applied — nobody is interested in inference or training alone,” said Christian Patze, the investment director at M Ventures, over email. M Ventures backed Celestial AI, one of the companies developing light-based AI accelerator chips. “A new technology, be it photonic or electronic, has to be able to work equally well for both applications, otherwise it’s not of interest for large customers in the space.”

Inference — that is, running trained models — is fairly expensive. According to one estimate, ChatGPT was costing OpenAI $100,000 per day at its peak, which works out to an eye-watering $3 million per month.

But on a more fundamental level, Patze isn’t convinced that “photonic compute” approaches are the best possible option for training. Photonic chips are physically larger than their electronic counterparts and difficult to mass-produce, for one, owing to the immaturity of photonic chip fabrication plants. Moreover, photonic architectures still largely rely on electronic control circuits, which can create bottlenecks.

“Where it concerns photonic chips, it very much depends on what the algorithm and workload is for the training and how those algorithms can be mapped onto photonic compute architectures,” Patze said. “People missed that AI compute is a lot more than doing [certain operations] efficiently and that mapping advanced AI models onto theoretically fast optical compute engines is not trivial — or rather very hard.”

The other issue with photonics chip tech, according to Patze, is the power required to convert data into a format that the chips can work with. With conventional, silicon-based hardware, data is transferred and stored in the digital domain, whereas in photonics, it’s analog. Converting between digital and analog necessitates dedicated converter components, which tend to be power hungry.

There’s yet another unresolved challenge in photonics: signal regeneration, or the process of regenerating optical signals degraded during transmission through photonic chips. Most chips perform signal regeneration through “optical to electrical to optical” conversion, in which a weak and distorted optical signal is detected and restored in a digital, electronic piece of hardware and then retransmitted to the optical domain. But this causes the signal to become distorted over time, leading to issues downstream.

For these reasons and others, Gartner analyst Anushree Verma, who studies emerging technologies and trends, believes that it’ll take more than eight years for photonic computing to become mainstream.

“If we think of an all-optical computing system, it needs a variety of components such as optical gates, optical switches, optical memories, optical interconnects and light sources,” Verma said. “Commercial availability of the critical components for photonic technology is still very far away for various reasons … The customer concentration as well as the technological challenges are not particularly hospitable to entrants.”

So is there a future in photonics for AI yet? Probably. It might just require a reframing.

Patze sees photonics helping to overcome one of the big bottlenecks in AI compute: delivering speedy access to large amounts of memory. For large AI models like OpenAI’s GPT-4, AI accelerator cards — think chips like Google’s TPUs or AWS’ Trainium — in server racks usually have tens or hundreds of cores and hundreds of gigabytes of memory that need to be interlinked. When strained with data, these classical electrical interconnects start to fail or become quite slow. But photonic interconnects don’t; they have far more capacity.

“Longer-distance connections at high bandwidth and low latency is exactly where photonic interconnects shine,” Patze said. “Hence, it’s not surprising that some companies are turning to those to overcome the data movement bottleneck in modern AI compute. The photonics interconnect approach also has much shorter development timelines, as photonic interconnect components and circuit design in principle are well known from the optical communication industry.”

Verma sees the photonics AI accelerator chip industry evolving in three stages over the next several years. First, she expects a wave of hybrid connectivity — silicon and photonics packaged together — followed by a flood of integrated platforms (the second phase), and then eventually full stack photonic computing as the third stage.

By 2027, Verma thinks that 10% of network switch deployments will have co-packaging optics, driven by the need for increasing bandwidth and reducing power consumption, up from a negligible percentage now.

“The primary challenge is the manufacturing cost reduction,” Verma said, “but the market will be driven by the need of increasing bandwidth and reducing power consumption, up from negligible now … Photonic computing is still in its very early stage.”

To her point, Mordor Intelligence predicts that the photonic integrated circuit market will be worth $26.42 billion by 2027.

More TechCrunch

The families of victims of the shooting at Robb Elementary School in Uvalde, Texas are suing Activision and Meta, as well as gun manufacturer Daniel Defense. The families bringing the…

Families of Uvalde shooting victims sue Activision and Meta

Like most Silicon Valley VCs, what Garry Tan sees is opportunities for new, huge, lucrative businesses.

Y Combinator’s Garry Tan supports some AI regulation but warns against AI monopolies

Everything in society can feel geared toward optimization – whether that’s standardized testing or artificial intelligence algorithms. We’re taught to know what outcome you want to achieve, and find the…

How Maven’s AI-run ‘serendipity network’ can make social media interesting again

Miriam Vogel, profiled as part of TechCrunch’s Women in AI series, is the CEO of the nonprofit responsible AI advocacy organization EqualAI.

Women in AI: Miriam Vogel stresses the need for responsible AI

Google has been taking heat for some of the inaccurate, funny, and downright weird answers that it’s been providing via AI Overviews in search. AI Overviews are the AI-generated search…

What are Google’s AI Overviews good for?

When it comes to the world of venture-backed startups, some issues are universal, and some are very dependent on where the startups and its backers are located. It’s something we…

The ups and downs of investing in Europe, with VCs Saul Klein and Raluca Ragab

Welcome back to TechCrunch’s Week in Review — TechCrunch’s newsletter recapping the week’s biggest news. Want it in your inbox every Saturday? Sign up here. OpenAI announced this week that…

Scarlett Johansson brought receipts to the OpenAI controversy

Accurate weather forecasts are critical to industries like agriculture, and they’re also important to help prevent and mitigate harm from inclement weather events or natural disasters. But getting forecasts right…

Deal Dive: Can blockchain make weather forecasts better? WeatherXM thinks so

pcTattletale’s website was briefly defaced and contained links containing files from the spyware maker’s servers, before going offline.

Spyware app pcTattletale was hacked and its website defaced

Featured Article

Synapse, backed by a16z, has collapsed, and 10 million consumers could be hurt

Synapse’s bankruptcy shows just how treacherous things are for the often-interdependent fintech world when one key player hits trouble. 

1 day ago
Synapse, backed by a16z, has collapsed, and 10 million consumers could be hurt

Sarah Myers West, profiled as part of TechCrunch’s Women in AI series, is managing director at the AI Now institute.

Women in AI: Sarah Myers West says we should ask, ‘Why build AI at all?’

Keeping up with an industry as fast-moving as AI is a tall order. So until an AI can do it for you, here’s a handy roundup of recent stories in the world…

This Week in AI: OpenAI and publishers are partners of convenience

Evan, a high school sophomore from Houston, was stuck on a calculus problem. He pulled up Answer AI on his iPhone, snapped a photo of the problem from his Advanced…

AI tutors are quietly changing how kids in the US study, and the leading apps are from China

Welcome to Startups Weekly — Haje‘s weekly recap of everything you can’t miss from the world of startups. Sign up here to get it in your inbox every Friday. Well,…

Startups Weekly: Drama at Techstars. Drama in AI. Drama everywhere.

Last year’s investor dreams of a strong 2024 IPO pipeline have faded, if not fully disappeared, as we approach the halfway point of the year. 2024 delivered four venture-backed tech…

From Plaid to Figma, here are the startups that are likely — or definitely — not having IPOs this year

Federal safety regulators have discovered nine more incidents that raise questions about the safety of Waymo’s self-driving vehicles operating in Phoenix and San Francisco.  The National Highway Traffic Safety Administration…

Feds add nine more incidents to Waymo robotaxi investigation

Terra One’s pitch deck has a few wins, but also a few misses. Here’s how to fix that.

Pitch Deck Teardown: Terra One’s $7.5M Seed deck

Chinasa T. Okolo researches AI policy and governance in the Global South.

Women in AI: Chinasa T. Okolo researches AI’s impact on the Global South

TechCrunch Disrupt takes place on October 28–30 in San Francisco. While the event is a few months away, the deadline to secure your early-bird tickets and save up to $800…

Disrupt 2024 early-bird tickets fly away next Friday

Another week, and another round of crazy cash injections and valuations emerged from the AI realm. DeepL, an AI language translation startup, raised $300 million on a $2 billion valuation;…

Big tech companies are plowing money into AI startups, which could help them dodge antitrust concerns

If raised, this new fund, the firm’s third, would be its largest to date.

Harlem Capital is raising a $150 million fund

About half a million patients have been notified so far, but the number of affected individuals is likely far higher.

US pharma giant Cencora says Americans’ health information stolen in data breach

Attention, tech enthusiasts and startup supporters! The final countdown is here: Today is the last day to cast your vote for the TechCrunch Disrupt 2024 Audience Choice program. Voting closes…

Last day to vote for TC Disrupt 2024 Audience Choice program

Featured Article

Signal’s Meredith Whittaker on the Telegram security clash and the ‘edge lords’ at OpenAI 

Among other things, Whittaker is concerned about the concentration of power in the five main social media platforms.

2 days ago
Signal’s Meredith Whittaker on the Telegram security clash and the ‘edge lords’ at OpenAI 

Lucid Motors is laying off about 400 employees, or roughly 6% of its workforce, as part of a restructuring ahead of the launch of its first electric SUV later this…

Lucid Motors slashes 400 jobs ahead of crucial SUV launch

Google is investing nearly $350 million in Flipkart, becoming the latest high-profile name to back the Walmart-owned Indian e-commerce startup. The Android-maker will also provide Flipkart with cloud offerings as…

Google invests $350 million in Indian e-commerce giant Flipkart

A Jio Financial unit plans to purchase customer premises equipment and telecom gear worth $4.32 billion from Reliance Retail.

Jio Financial unit to buy $4.32B of telecom gear from Reliance Retail

Foursquare, the location-focused outfit that in 2020 merged with Factual, another location-focused outfit, is joining the parade of companies to make cuts to one of its biggest cost centers –…

Foursquare just laid off 105 employees

“Running with scissors is a cardio exercise that can increase your heart rate and require concentration and focus,” says Google’s new AI search feature. “Some say it can also improve…

Using memes, social media users have become red teams for half-baked AI features

The European Space Agency selected two companies on Wednesday to advance designs of a cargo spacecraft that could establish the continent’s first sovereign access to space.  The two awardees, major…

ESA prepares for the post-ISS era, selects The Exploration Company, Thales Alenia to develop cargo spacecraft