Hive's 700,000 Gig Workers Train the AI That Scans the Super Bowl

The content moderation specialist, backed by General Catalyst and 8VC, is now seeking $200 million at a reported $4 billion valuation.

About Hive

Published

Hive AI processes billions of API requests every month for hundreds of companies, but its most distinctive asset is not a server rack. It is a crowd. The San Francisco-based company uses a network of roughly 700,000 gig workers, accessed through its Hive Work app, to label the data that trains its proprietary machine learning models [PERPLEXITY SONAR PRO BRIEF, Oct 2021]. That human-powered engine is the wedge for a suite of enterprise APIs aimed at a single, high-stakes problem: understanding everything users post online.

Founded in 2017 by Kevin Guo and Dmitriy Karpman, Hive sells machine learning as a service. Its models classify images, video, text, and audio for harmful content, brand safety, and synthetic media. The company’s content moderation tools are a backbone for livestreaming platforms, its contextual advertising models are used by NBC Universal and Vevo, and its deepfake detection technology has drawn a partnership with the Pentagon’s Defense Innovation Unit [Hive (artificial intelligence company) - Wikipedia, Unknown], [Announcing Hive’s Partnership with the Defense Innovation Unit - Hive, Unknown]. At full utilization, its systems can handle 2-4 million completed tasks per day [Documentation | Hive, Unknown].

Now, Hive is reportedly seeking a new round to scale the operation. According to 2023 reports, the company is aiming to raise $200 million at a valuation of up to $4 billion [Hive AI Seeks to Raise $200 Million at $4 Billion Valuation | PYMNTS.com, 2023]. For investors like General Catalyst and 8VC, the bet is that a massive, proprietary training dataset,curated by that distributed workforce,creates a moat too wide for cloud giants to cross casually.

The data factory advantage

Every large AI company needs training data. Hive’s differentiator is owning the factory. While competitors like Google Cloud Vision or Amazon Rekognition use vast internal datasets, Hive’s approach is explicitly outsourced and scaled. The Hive Work app turns data labeling into micro-tasks completed by a global pool of contractors. This structure allows the company to rapidly adapt models to new forms of harmful content, a critical need for social platforms and marketplaces.

The output is a portfolio of pre-trained, production-ready APIs. Enterprise customers, primarily those with massive volumes of user-generated content, can integrate these models without building their own labeling pipelines or ML teams.

  • Content moderation. Automated detection of nudity, violence, hate speech, and self-harm imagery across all media types remains a core offering [PERPLEXITY SONAR PRO BRIEF, Unknown].
  • Specialized safety. A dedicated product for detecting child sexual abuse material (CSAM) is highlighted prominently on its site [PERPLEXITY SONAR PRO BRIEF, Unknown].
  • Synthetic media defense. APIs to detect deepfakes and AI-generated artwork address a growing threat for media companies and platforms [PERPLEXITY SONAR PRO BRIEF, Unknown].
  • Generative expansion. The company also offers image generation models, including SDXL and Flux Schnell variants, positioning itself across the full content lifecycle [October, 2024 - Blog & Insights | Hive, Oct 2024].

Scaling the enterprise pipeline

Hive’s traction is measured in API calls and brand names. The company says it serves "billions of customer API requests every month" and works with "hundreds of companies" [PERPLEXITY SONAR PRO BRIEF, Unknown]. Third-party data suggests over 584 companies used Hive AI as of 2025 [PERPLEXITY SONAR PRO BRIEF, Unknown]. Its models have been deployed at scale during major live events like the Super Bowl and March Madness [Hive (artificial intelligence company) - Wikipedia, Unknown].

The technical infrastructure is built for high throughput. Default rate limits allow 25-50 tasks per second, a concurrency level designed for the largest social and media platforms [Documentation | Hive, Unknown]. This performance claim is central to its enterprise sales motion, targeting customers for whom moderation latency directly impacts user experience and regulatory risk.

Key Investor Notable Focus
General Catalyst Growth-stage technology platforms
8VC Frontier tech and data-centric businesses
Tuas Capital Partners Undisclosed

The founder calculus

CEO Kevin Guo and CTO Dmitriy Karpman, both Forbes 30 Under 30 alumni in 2020, built Hive around a data-centric thesis [Dmitriy Karpman, 29, and Kevin Guo, 28 - 2019-12-03 - 2020 30 Under 30: Media, 2019]. Guo’s background spans finance, a prior role as Co-Chairman at Chinese P2P lender Dianrong.com, and an AI research internship at NASA [Kevin Guo, Dianrong.com: Profile and Biography - Bloomberg Markets, 2026], [Kevin Guo - AI/ML Research Intern - NASA Ames Research Center | LinkedIn, 2026]. Karpman brings the technical leadership to productize the research.

Their combined experience points toward a company engineered for commercial deployment, not pure research. The focus on turnkey applications, especially for content moderation and security, suggests a pragmatic understanding of what enterprises will pay for today: risk reduction.

Where the model could stall

Hive’s reliance on a crowdsourced workforce presents a dual-edged sword. The scale is an asset, but managing ethical labor practices, quality consistency, and cost dynamics across 700,000 gig workers introduces operational complexity that purely automated or in-house labeled competitors do not face. Any significant disruption to this labor pool or escalation in its cost could pressure margins.

Furthermore, the competitive landscape is dense with well-capitalized incumbents. Amazon, Google, and Microsoft offer their own vision and moderation APIs, often bundled deeply into broader cloud contracts. Hive’s answer is specificity and performance,its models are trained on the exact types of problematic content its customers need to catch, and its systems are tuned for the concurrency of a global livestream. The question is whether that specialization is enough to prevent the cloud giants from simply replicating the dataset over time.

The next funding inflection

The reported $200 million fundraise would mark a significant inflection. A $4 billion valuation, if achieved, would place Hive in the upper tier of specialized AI infrastructure companies. The capital would likely fuel further vertical expansion beyond its strongholds in social media and streaming, potentially into adjacent regulated industries like financial services compliance or healthcare content review.

The company’s partnership with the Defense Innovation Unit is a signal in that direction, proving its models meet the stringent requirements of government security contracts [Announcing Hive’s Partnership with the Defense Innovation Unit - Hive, Unknown]. For investors like General Catalyst and 8VC, the next check is a bet that Hive’s human-in-the-loop data engine can defend its territory against both cloud hyperscalers and a new wave of AI startups. Can a workforce of 700,000 train a model that stays ahead of everything the internet creates next?

Sources

  1. [Documentation | Hive, Unknown] Hive API documentation on rate limits and throughput | https://docs.thehive.ai/
  2. [Hive AI Seeks to Raise $200 Million at $4 Billion Valuation | PYMNTS.com, 2023] Funding round report | https://www.pymnts.com/
  3. [Hive (artificial intelligence company) - Wikipedia, Unknown] Company history and customer use cases | https://en.wikipedia.org/wiki/Hive_(artificial_intelligence_company)
  4. [Announcing Hive’s Partnership with the Defense Innovation Unit - Hive, Unknown] DIU partnership announcement | https://thehive.ai/blog/announcing-hives-partnership-with-the-defense-innovation-unit
  5. [October, 2024 - Blog & Insights | Hive, Oct 2024] Image generation model updates | https://thehive.ai/blog/2024/10
  6. [Dmitriy Karpman, 29, and Kevin Guo, 28 - 2019-12-03 - 2020 30 Under 30: Media, 2019] Forbes 30 Under 30 profile | https://www.forbes.com/profile/dmitriy-karpman-and-kevin-guo/
  7. [Kevin Guo, Dianrong.com: Profile and Biography - Bloomberg Markets, 2026] Kevin Guo's background at Dianrong | https://www.bloomberg.com/profile/person/21556695
  8. [Kevin Guo - AI/ML Research Intern - NASA Ames Research Center | LinkedIn, 2026] Kevin Guo's NASA internship | https://www.linkedin.com/in/kevin-guo-

Read on Startuply.vc