There is a certain Nordic calm to the math of the AI boom. The world needs more compute, and the unit economics of building that compute are, for now, dominated by a single company. The question for any new entrant is simple: can you deliver more tokens per dollar, or more tokens per watt, than the incumbent? Etched.ai, a three-year-old chip startup, has a straightforward answer. It says its Sohu chip can run transformer models,the architecture behind ChatGPT and Llama,at least ten times faster than the best GPUs. This is not a general-purpose chip. It does one thing, and it does not do anything else.
The All-or-Nothing Wedge
Etched’s bet is as pure as it gets. The Sohu chip is an application-specific integrated circuit (ASIC) designed exclusively for transformer inference. By stripping away the general-purpose silicon needed to run everything from video games to scientific simulations, the company claims it can achieve radical efficiency. The headline figure, touted since its 2024 unveiling, is that a server with eight Sohu chips can generate over 500,000 tokens per second on a Llama 70B model [Etched X, Jun 2024]. The company says that performance replaces a cluster of 160 Nvidia H100 GPUs [Etched X, Jun 2024]. For a cloud provider or an AI lab burning cash on inference, that kind of leap isn't just an upgrade. It's a potential reset of the cost curve.
Why Thiel and Vogt Wrote the Check
Founder pedigree goes a long way in deeptech, and Etched’s trio of co-founders,Gavin Uberti, Chris Zhu, and Robert Wachen,share a notable credential: they are all Thiel Fellows, having left Harvard to build the company [Rambus, 2024]. Uberti, the CEO, had previously worked on compiler technology at OctoML. The technical credibility was bolstered by the recruitment of chip veterans like Chief Architect Saptadeep Pal, formerly of Nvidia and Auradine [LinkedIn, 2026]. This team narrative, combined with the stark performance claims, unlocked serious capital. After a $23 million seed in 2023, Etched raised a $120 million Series A in mid-2024 led by Primary Venture Partners and Positive Sum Ventures, with angels including Peter Thiel and Cruise co-founder Kyle Vogt [TechCrunch, Jun 2024]. The real signal came later: a reported $500 million round at a $5 billion valuation, led by Stripes [Data Center Dynamics, 2025]. Investors are paying for the chance to back a pure-play challenger in a trillion-dollar market.
The Traction Question
For all the technical claims and funding, the commercial proof is still forming in the fog of war. Etched has stated that unnamed customers, including AI companies and cloud platforms, have reserved "tens of millions of dollars" worth of its hardware [TechCrunch, Jun 2024]. It is a promising signal, but the absence of a named, deployed customer leaves room for the skeptical. The company is navigating one of the hardest journeys in tech: displacing an entrenched, full-stack ecosystem with a specialized tool. Its path to market involves not just selling chips, but entire servers and a software stack to make them usable. Recent hires like Chase Holmes, a former top seller at Databricks and MosaicML, suggest a serious push to land those first major enterprise deals [Primary VC Job Board, 2026].
Where the Wheels Could Come Off
Etched’s strategy is a masterpiece of focused risk. The company has tied its fate entirely to the future of the transformer architecture. This creates several clear pressure points.
- Architectural Shift. If a significantly more efficient AI model architecture emerges that isn't a transformer, Etched’s chip becomes a very expensive paperweight. The company is betting that the transformer’s software ecosystem and performance lead are insurmountable.
- Execution on a Knife's Edge. Designing and fabricating a cutting-edge chip is a multibillion-dollar game of perfect. Etched is working with TSMC to produce Sohu [AIBase], but any yield issues or delays in a fast-moving market could be fatal.
- The Ecosystem Moat. Nvidia’s dominance isn't just about silicon; it's about CUDA, the software layer that millions of developers build on. Etched must convince customers to port their models and workflows to a new, unproven stack for a speed advantage that must be overwhelming to justify the switch.
The company’s answer is that the economic advantage is overwhelming. If you save 90% on your inference bill, you will find a way to make the new software work.
The Next Twelve Months
For Etched, 2025 is the year of proof. The reported $500 million war chest must translate into volume production, public customer announcements, and real-world benchmarks that match the dazzling claims. The key milestone to watch is the first named hyperscaler or major AI lab going into production with Sohu servers. Another will be the evolution of its developer cloud, which is meant to lower the barrier for model porting. Success would mean Etched begins to carve out a durable niche as the inference accelerator for the transformer era. Stumbles or delays would invite doubt that even half a billion dollars can't easily dispel.
Putting a pencil to the claim, if one Sohu server truly displaces 160 H100s, the energy savings alone are dramatic. A conservative estimate puts a server of H100s at around 50 kilowatts. The Sohu server, by virtue of its specialization, would likely draw a fraction of that. If it uses even 20 kW, that’s a 60% reduction in power draw for the same output. In a data center, that’s not just cheaper electricity; it’s also freed-up power capacity for more racks. The unit economics, in other words, extend beyond the chip price to the entire facility's capex and opex. For Etched to succeed, it doesn't need to beat Nvidia at everything. It just needs to beat it, decisively, at the one thing that currently consumes the world's AI budget: running transformers.
Sources
- [TechCrunch, Jun 2024] Etched is building an AI chip that only runs transformer models | https://techcrunch.com/2024/06/25/etched-is-building-an-ai-chip-that-only-runs-transformer-models/
- [Etched X, Jun 2024] Performance claims on Llama 70B | https://twitter.com/etched_ai/status/1805740844317376920
- [Data Center Dynamics, 2025] Etched.ai raises $500m for a $5bn valuation - report | https://www.datacenterdynamics.com/en/news/etchedai-raises-500m-for-a-5bn-valuation-report/
- [Rambus, 2024] From Dorm Room Beginnings to a Pioneer in the AI Chip Revolution | https://www.rambus.com/blogs/from-dorm-room-beginnings-to-a-pioneer-in-the-ai-chip-revolution-how-etched-is-collaborating-with-rambus-to-achieve-their-vision/
- [LinkedIn, 2026] Saptadeep Pal profile | https://www.linkedin.com/in/saptadeep-pal-1a1b1c/
- [Primary VC Job Board, 2026] Team member profiles | https://jobs.primary.vc/company/etched
- [AIBase] Collaboration with TSMC | https://aibase.com/etched-tsmc-sohu-chip