What is Scale AI?
The data foundry for AI — training data, model evaluation, and government AI applications, now ~49% owned by Meta.
- Category
- AI data infrastructure
- Headquarters
- San Francisco, CA
- Founded
- 2016
- Employees
- ~1,300 (post-layoff)
- 2025 revenue
- ~$2B (doubled YoY)
- Valuation
- ~$29B (Meta, Jun 2025)
What is Scale AI?
Scale AI is the largest independent 'data foundry' for artificial intelligence — it labels, generates, and curates the training data, human feedback, and model evaluations used to build and grade frontier AI models. Founded in 2016, it grew from autonomous-vehicle image labeling into a core data supplier for OpenAI, Google, Microsoft, Meta, and the U.S. government, then in June 2025 sold a roughly 49% non-voting stake to Meta for $14.3 billion at a ~$29 billion valuation.
Scale runs a vertically integrated platform plus a managed network of human annotators and subject-matter experts, organized into three lines: Build AI (the Scale Data Engine for generative AI, government, and automotive), Apply AI (the Scale GenAI Platform and the Donovan defense product), and Evaluate AI (Scale Evaluation for benchmarking and red-teaming models). In practice, customers pay Scale to produce the high-quality labeled and human-feedback data that makes large language models accurate, safe, and aligned, and increasingly to deploy and evaluate those models in production.
The business scaled with the LLM boom and then accelerated through the Meta deal. Revenue rose from roughly $870 million in 2024 to about $2 billion in 2025 — more than doubling — on the back of over $1 billion in new bookings, nearly half of which closed in Q4 2025. Customers cluster into three segments: frontier model labs, the U.S. government (including a $249 million Department of Defense engagement and two nine-figure awards in Q3 2025), and enterprises, with new 2025 logos including Mayo Clinic, BP, and Allianz.
Meta's June 2025 investment reshaped the company. Founder-CEO Alexandr Wang left to lead Meta's Superintelligence Labs, Jason Droege stepped in as interim CEO, and several rival labs — Google, OpenAI, Microsoft, and xAI — began pulling work over data-leakage and neutrality concerns, prompting a 14% workforce cut in July 2025. Scale remains independent and is repositioning around enterprise, government, and model-evaluation demand under a 'reliability' mantra.
What does Scale AI offer?
Scale sells training-data generation, human feedback (RLHF), model evaluation, and applied/government AI products, grouped into Build AI, Apply AI, and Evaluate AI.
- Scale Data Engine· Build AI
- Generative AI data (RLHF / SFT)· Build AI
- Data labeling & annotation· Build AI
- Autonomous-vehicle / sensor data· Build AI
- Synthetic data generation· Build AI
- Scale GenAI Platform· Apply AI
- Scale Donovan (defense AI)· Apply AI
- Scale Evaluation· Evaluate AI
- Model benchmarking & red-teaming· Evaluate AI
- Expert / human-feedback network· Services
- Public sector & government AI· Government
How does Scale AI make money?
Scale makes money by charging for data work and AI services — per-task/per-unit pricing for self-serve labeling and custom enterprise and government contracts for large training-data, RLHF, evaluation, and applied-AI programs. It marks up a managed network of human labor, yielding an estimated 50%+ gross margin.
At the low end, Scale runs a pay-as-you-go, per-unit model: historically about 2¢ per image and 6¢ per annotation, with free starter tiers (the first ~1,000 labeling units or ~10,000 images free) to seed self-serve usage. The economics come from arbitrage — Scale programmatically routes tasks to a global pool of annotators and experts, marks up the labor, and expands margin as pre-labeling AI lets each contractor label more per hour — producing roughly 50%+ gross margins (below pure-software norms because of the heavy human-service component).
The real revenue is enterprise and government: multi-million-dollar, volume-committed contracts for frontier-model training data, human feedback, evaluation, and increasingly applied-AI deployments. Google was reportedly Scale's largest customer at roughly $200 million per year, and Scale holds a $249 million U.S. Department of Defense engagement plus two nine-figure government awards booked in Q3 2025 — a sign that large, recurring, services-heavy deals (not credit-card-style usage) drive the bulk of the top line.
Growth historically tracked the frontier-lab spend race on data and human feedback. After Meta's stake triggered rival labs to pull work, Scale's growth thesis shifted toward government, enterprise 'Apply AI' deployments (Mayo Clinic, BP, Allianz), and model evaluation — markets where neutrality, reliability, and security clearances matter more than a competitor's ownership. 2025 revenue still roughly doubled to ~$2 billion on more than $1 billion in new bookings, showing the rebuild outran the customer churn.
Who leads Scale AI?
Scale was founded by Alexandr Wang and Lucy Guo in 2016. Since June 2025 it has been led by interim CEO Jason Droege, after Wang left to run Meta's Superintelligence Labs (he remains a Scale board director).
- Jason DroegeInterim Chief Executive OfficerCEO since June 2025 (joined Aug 2024 as Chief Strategy Officer)Ex-Uber Eats founder and Axon executive; still holds the interim title in 2026 and is leading Scale's post-Meta restructuring toward enterprise, government, and evaluation under a 'reliability' mantra.
- Alexandr WangCo-founder & Board Director (former CEO)Co-founder 2016; CEO 2016–2025MIT dropout who built Scale into an industry standard; left in June 2025 to lead Meta's Superintelligence Labs as its Chief AI Officer but stays on Scale's board.
- Lucy GuoCo-founder (departed 2018)Co-founder 2016Carnegie Mellon dropout; left in 2018 but kept a ~5% stake worth ~$1.25B at the Meta valuation, making her (per Forbes) the world's youngest self-made woman billionaire. Now CEO of creator-monetization startup Passes.
- Dennis CinelliChief Financial OfficerCFOOwns finance, capital structure, and the budget approvals that gate large enterprise software and infrastructure purchases.
How do you contact Scale AI's leadership?
Scale AI's email pattern is First.Last@scale.com (e.g. jane.doe@scale.com), the dominant format on its primary domain. The addresses below follow that verified pattern; they are not individually published, so treat them as format-derived rather than confirmed personal inboxes. For media, Scale routes through press@scale.com.
jane.doe@scale.comHow much funding has Scale AI raised?
Scale AI raised about $1.6 billion in venture equity across roughly six priced rounds from 2016 to 2024, reaching a ~$13.8 billion valuation. In June 2025 Meta invested $14.3 billion for a ~49% non-voting stake, valuing Scale at about $29 billion.
The early rounds were classic Silicon Valley: a 2016 Y Combinator seed, a $4.5 million Series A in May 2017 led by Accel, and an $18 million Series B in August 2018 led by Index Ventures. Scale became a unicorn with its $100 million Series C in August 2019, led by Peter Thiel's Founders Fund at a $1 billion valuation, with Coatue, Index, Spark, and Thrive participating.
The LLM era accelerated everything. A $155 million Series D in December 2020, led by Tiger Global, valued Scale at about $3.5 billion; a $325 million Series E in April 2021 (Dragoneer and Greenoaks, with Tiger) roughly doubled that to ~$7 billion. In May 2024, Accel led a $1 billion Series F that nearly doubled the valuation again to ~$13.8 billion, with NVIDIA, Amazon, Meta, Y Combinator, Thrive, Coatue, and Spark participating. In early 2025 the company also floated a tender offer near a $25 billion valuation to give shareholders liquidity.
The defining event was June 2025: Meta paid $14.3 billion for a ~49% non-voting stake, valuing Scale at roughly $29 billion and giving early backers like Accel a large cash return. The deal was structured as an investment rather than an acquisition so Scale could remain independent; founder Alexandr Wang and senior researchers moved to Meta as part of the arrangement.
How did Scale AI get here?
From a 2016 autonomous-vehicle labeling startup to a ~$29B AI data foundry doing ~$2B in 2025 revenue, capped by Meta's $14.3B stake and a leadership change in 2025.
- 2016Founded in San FranciscoAlexandr Wang (MIT dropout) and Lucy Guo (CMU dropout) start Scale via Y Combinator, initially labeling sensor data for self-driving cars.
- Aug 2019Becomes a unicorn$100M Series C led by Founders Fund (Peter Thiel) at a $1B valuation; customers expand beyond AV to OpenAI, Airbnb, and Lyft.
- Apr 2021$7B valuation$325M Series E led by Dragoneer and Greenoaks with Tiger Global, riding early generative-AI demand.
- May 2024Series F at ~$13.8B$1B round led by Accel; NVIDIA, Amazon, and Meta participate as the LLM data race peaks; revenue exits the year at a ~$1.5B run rate.
- Jun 2025Meta invests $14.3B for ~49%Values Scale at ~$29B; founder Alexandr Wang departs to lead Meta's Superintelligence Labs, and Jason Droege becomes interim CEO.
- Jul 202514% layoffs amid customer exitsScale cuts ~200 employees and 500 contractors as Google, OpenAI, Microsoft, and xAI pull work over data-leakage and neutrality concerns.
- 2025Revenue roughly doubles to ~$2BOver $1B in new bookings (nearly half in Q4); new enterprise logos include Mayo Clinic, BP, and Allianz, and government books two nine-figure awards.
Who are Scale AI's competitors?
Scale competes with a new wave of expert-data and RLHF providers (Surge AI, Mercor, Micro1) and with annotation/data-platform players (Labelbox, Snorkel AI, Appen) — many of which gained share after Meta's stake pushed labs to diversify suppliers.
- Surge AIBootstrapped RLHF and human-feedback rival reportedly larger than Scale by revenue (~$1.2B in 2024); raising its first outside capital (~$1B) at a $15–25B valuation.
- MercorFast-growing expert-marketplace for AI training data (~$450–500M ARR); raised a $350M Series C in Oct 2025 at a ~$10B valuation and is a top beneficiary of labs leaving Scale.
- LabelboxData-labeling and annotation platform emphasizing software-first tooling and an on-demand labeling workforce.
- Snorkel AIProgrammatic data-labeling and development platform — write labeling functions instead of annotating point-by-point.
- AppenPublic, long-established global crowd-data provider for training and evaluation across languages and modalities.
- Micro1AI-native vetting and expert-labeling startup positioning as a leaner, automation-heavy Scale alternative.
Scale AI — frequently asked questions
