Reka AI's $110 Million Bet on Efficient, On-Premise Multimodal AI

The Singapore-based startup, backed by NVIDIA and Snowflake, is building frontier-level models designed to run inside enterprise firewalls.

By Bash Okafor

About Reka AI

Published 2026-07-03T15:05:59.399Z

Reka AI emerged from stealth in 2023 with a clear technical specification. Its models would be multimodal, processing text, images, video, and audio. They would be frontier-class, aiming for parity with the largest closed models. And they would be efficient, built to run on customer-controlled infrastructure with orders of magnitude lower compute [TechCrunch, June 2023]. Two years and $168 million in funding later, that specification has become a billion-dollar bet, with a recent $110 million Series B round led by NVIDIA and Snowflake valuing the company at $1 billion [SiliconANGLE, July 2025]. The wedge is not just capability, but deployability.

A research team turned commercial builders

The founding team reads like a roster from the leading labs of the last decade. CEO Dani Yogatama and Chief Scientist Yi Tay are alumni of DeepMind and Google Brain. Co-founders Cyprien de Masson d’Autume, Qi Liu, and Mikel Artetxe bring experience from Meta’s FAIR and other top research institutions [TechCrunch, June 2023]. This pedigree is a direct answer to the primary challenge in the frontier model race: the sheer research and engineering talent required to keep pace. Reka’s initial capital, a combined $108 million seed round in 2023, was essentially a bet that this group could translate academic-grade research into a commercial product [TechCrunch, June 2023] [Reuters, June 2023].

The product wedge: efficiency as a feature

While competitors like OpenAI and Anthropic compete on raw benchmark performance via API, Reka’s differentiation is architectural. The company builds a suite of models, from the flagship 21-billion parameter Reka Flash to the larger Reka Core, all trained from scratch for multimodal understanding [arXiv, 2024]. The key claim is that these models deliver comparable quality to much larger models while being small and efficient enough for on-premise or private cloud deployment [TechCrunch, June 2023]. This is not a stripped-down version of a giant model; it is a core design constraint.

The product suite is organized around this efficient core:

Reka Flash. A 21-billion parameter multimodal model serving as the workhorse, with publicly tracked performance improvements on the LMSYS Chatbot Arena [reka.ai, 2026].
Reka Core. The frontier-class foundation model for applications requiring maximum capability.
Reka Vision & Research. Specialized platforms built on Flash for video analysis and complex, agentic research workflows [growthengineer.ai].
Yasa. The initial commercial product, an AI assistant trained to derive insights from a company’s internal, multimodal data [TechCrunch, June 2023].

Strategic validation from infrastructure giants

Reka’s enterprise-focused, deployable model thesis has attracted more than capital; it has drawn strategic partners from the infrastructure layer. The Series B lead investors, NVIDIA and Snowflake, are not passive financial backers [finance.yahoo.com, July 2025]. NVIDIA’s involvement signals validation of the model efficiency and training approach on its hardware stack. Snowflake’s investment points directly to the data layer, where Reka’s models could become a native AI engine for the vast proprietary datasets stored in the Data Cloud. A separate partnership with Oracle, selecting OCI as a preferred cloud provider for training and serving, further cements this infrastructure-aligned strategy [aiwiki.ai].

Seed Round (2023) | 108 | M USD
Series B (2025) | 110 | M USD

The scale-up challenge

The technical premise is compelling for regulated industries, government agencies, and any enterprise with sensitive data that cannot leave its perimeter. The model efficiency allows for a total cost of ownership that undercuts running a massive API-centric model at scale. However, the path from a technically elegant model to a dominant enterprise platform is fraught.

The primary risk is the renewal motion. Enterprise AI contracts are often won on a specific, high-value pilot project. The long-term enterprise platform sale requires a different muscle: deep integration, robust tooling for model management and monitoring, and a customer success engine that proves continuous value. Reka’s team is dense with world-class researchers, but the public record shows less experience in building and scaling a global enterprise sales and support organization capable of competing with the established outreach of Google, Microsoft, and Amazon.

Furthermore, efficiency is a moving target. The larger closed models are also relentlessly optimizing their cost per token. If the performance gap widens significantly, the efficiency argument weakens. Reka must continue to advance its core research while simultaneously building the commercial machinery of a billion-dollar software company,a dual-track challenge that has derailed many deep-tech startups.

The next twelve months

With the new capital, the immediate milestones are clear. The company must transition from flagship model demonstrations to published, large-scale enterprise deployments. Landing a marquee customer in a tightly regulated sector like finance or healthcare would serve as a powerful proof point for the on-premise value proposition. Technically, maintaining the performance trajectory of Reka Flash on public benchmarks like LMSYS will be critical to validating the “frontier-class” claim against ever-improving competitors.

The partnership with Snowflake is the strategic lever to watch. If Reka can become the default, efficient multimodal AI option within the Snowflake ecosystem, it gains a distribution channel that bypasses traditional enterprise sales cycles. The next year will show whether this is a loose partnership or a deeply integrated technical roadmap.

For infrastructure teams evaluating multimodal AI, the technical breakdown is straightforward. Reka offers a family of models where the size-to-performance ratio is the primary innovation. The Flash 3.1 model, for instance, acts as a base for both the Vision and Research agent platforms, suggesting a unified, efficient core [reka.ai, 2026]. The sober assessment is that this architecture is ideal for controlled environments but untested at the scale of thousands of concurrent enterprise users, each with custom fine-tuned variants. The failure mode at scale isn’t the model failing, but the surrounding platform,the orchestration, security, and management layer,buckling under operational complexity. Reka’s $110 million is a bet that they can build that platform before the efficiency advantage evaporates.

Sources

[TechCrunch, June 2023] Reka emerges from stealth to build custom AI models for the enterprise | https://techcrunch.com/2023/06/27/reka-emerges-from-stealth-to-build-custom-ai-models-for-the-enterprise/
[SiliconANGLE, July 2025] Multimodal AI startup Reka AI raises $110M at $1B valuation | https://siliconangle.com/2025/07/22/multimodal-ai-startup-reka-ai-raises-110m-1b-valuation/
[Reuters, June 2023] AI model startup Reka raises $50 million led by DST Global Partners | https://www.reuters.com/technology/ai-model-startup-reka-raises-50-mln-led-by-dst-global-2023-06-27/
[finance.yahoo.com, July 2025] Reka AI raises $110M Series B led by NVIDIA, Snowflake | https://finance.yahoo.com/news/reka-ai-raises-110m-series-120000000.html
[arXiv, 2024] Reka Core, Flash, and Edge: Multimodal Language Models | https://arxiv.org/html/2404.12387v1
[reka.ai, 2026] Reka Flash updates and performance | https://reka.ai/news/reka-flash-updates
[growthengineer.ai] Reka AI company and product overview | https://growthengineer.ai/startups/reka
[aiwiki.ai] Reka AI partnership with Oracle | https://aiwiki.ai/wiki/Reka_AI

Read on Startuply.vc