growth11 min readMay 21, 2026

How Hugging Face Became the GitHub of AI on a Failed Chatbot's Ashes

Hugging Face spent three years failing as a teen chatbot, then shipped a 600-line PyTorch port that became the namespace every NLP paper imports. The substrate playbook — and why you can't reopen the window.

TL;DR

Hugging Face spent roughly three years failing as a teen chatbot — a few thousand users, $5.2M raised on a thesis no investor believed — before the pivot that mattered: a PyTorch port of Google's BERT shipped within about a week of BERT's release.
That open-source library, transformers, became the namespace every NLP paper in the world imports. The company rode it to $130M revenue and profitability by 2024 — on just $59M of substrate-period burn — and a $4.5B valuation with no priced round since August 2023.
The Series D pulled in eight major tech-platform strategics in a single round — Salesforce, Google, Amazon, Nvidia, AMD, Intel, IBM, Qualcomm — several of whom compete directly. The conflict was the point: it sold neutrality.

The Numbers

Date	Revenue / ARR	Valuation	Milestone
2016	$0	—	Founded in NYC; teen-chatbot product
Oct 2018	$0	—	PyTorch BERT port ships — the real pivot
Dec 2019	—	~$15M	Series A; "definitive NLP library"
Mar 2021	—	~$40M	Series B; cash-flow positive disclosed
May 2022	~$10M ARR	$2B	Series C; "GitHub of machine learning"
Aug 2023	~$50-70M ARR	$4.5B	Series D; eight strategic investors
Dec 2024	~$130M revenue	$4.5B (no new round)	Profitability declared

Revenue figures are Sacra/Contrary/Latka triangulated estimates; the profitability claim is a public statement from CEO Clément Delangue. Three-and-a-half years of substrate-period burn (October 2018 to May 2022) totaled $59M raised — cheap by comparison, because open-source libraries have no inference cost and the academic audience evangelizes for free.

What they did differently

Move 1: Ship the substrate the moment a research field needs a unifying API

Hugging Face is the only major AI case where the company spent its first three years failing as a consumer product. Three French co-founders — Clément Delangue (CEO), Julien Chaumond (CTO), Thomas Wolf (CSO) — incorporated in NYC in 2016 and built an "AI best friend" iOS chatbot for teenagers. The MIT Tech Review diary from March 2017 reported a user base of a few thousand teens — not the millions later retrospectives sometimes claim. The company raised $1.2M angel and $4M seed on the chatbot thesis.

The decision that mattered during the chatbot era was a hire, not a feature. Wolf was brought on as a research scientist, building NLP infrastructure to power the chatbot — but in PyTorch, with academic discipline, against the literature. When Google released BERT in October 2018 in TensorFlow, the gap was structural: most academic NLP researchers preferred PyTorch. Wolf and a small team shipped a PyTorch port — pytorch-pretrained-bert — within roughly a week. Within months it was the way most NLP papers reported using BERT. The chatbot did not pivot in October 2018. The substrate did. The December 2019 Series A was merely the formal recognition of a shift that had already happened twelve months earlier.

The September 2019 v2.0.0 release brought the canonical transformers name, and the October 2019 arXiv paper by Wolf et al. formalized academic credibility. Every grad student who learned NLP after 2019 learned it through the Hugging Face API.

Move 2: Turn the library into a platform — make the URL the distribution

By 2021 the library was enough; the strategic question was what to build around it. The answer was the Hub: model hosting, then dataset hosting, then huggingface.co/spaces in October 2021 — free hosting for ML demos, with Gradio as the standard SDK. In December 2021 Hugging Face acquired Gradio outright — a five-engineer startup that was about to shut down.

This was the narrative upgrade in motion. The company's framing shifted from "the open-source NLP library" to "the GitHub of machine learning," and the Series C of $100M at a $2B valuation in May 2022 was the first time the platform framing carried a multiple. Spaces mattered for a reason most analyses miss: a model card is a static artifact, but a Space is a working web app anyone can fork, modify and rerun. When Stable Diffusion released in August 2022, it went viral on Hugging Face Spaces before it went viral anywhere else — the canonical demo Spaces were forked thousands of times.

The structural advantage here does not transfer cleanly. The company name is the library name is the URL is the audience: importing from transformers import AutoModel is reaching into Hugging Face's namespace. Every arXiv paper that cites a huggingface.co model URL is a long-tail distribution push, and every grad student who replicates that experiment learns the URL pattern by the time they have buying power. In a four-year academic cycle, the substrate compounds at the rate of grad-student turnover — a flywheel a normal SaaS landing page cannot manufacture.

Move 3: Sell neutrality by taking checks from rivals at once

The Series D of $235M at a $4.5B valuation in August 2023 was Hugging Face's commercial coronation, and the investor list was the headline: Salesforce Ventures led, with Google, Amazon, Nvidia, AMD, Intel, IBM and Qualcomm all participating.

Eight major tech-platform strategics in one round — and several compete with each other directly. Google and Amazon are cloud rivals. Nvidia, AMD and Intel are silicon rivals. As CNBC put it:

Google, Amazon, Nvidia, AMD and other tech giants invest in Hugging Face.

— CNBC headline, August 24, 2023

The conflict was the point. The thesis Hugging Face is selling is neutrality — to be the open substrate where every model from every organization runs, you need to look unambiguously not-aligned with any single hyperscaler. The way to look not-aligned is to take checks from all of them at once. It is a financial-engineering move with a GTM payload: each strategic becomes an executive-level customer, an integration partner, and a co-marketing pipeline. The neutrality framing is also why there has been no priced primary round since — at $130M revenue and profitable, Hugging Face has not needed one.

What you can copy

Watch for the hire that becomes the pivot. Hugging Face's turning point was a research-scientist hire who built the substrate in the right framework. The capability that saves you may already be inside the company — staffed against the wrong product.
Ship into the gap the moment a field shifts. When a major release lands in the wrong framework for its core audience, a fast, disciplined port can capture mindshare in months. Speed against a structural gap beats polish.
Make your URL the distribution. Architect the product so users reference your URL pattern in their own work — papers, READMEs, blog posts. Then the citation is the marketing, and it compounds without spend.
Build a forkable demo format. A static artifact gets viewed; a working app that others can duplicate and rerun gets propagated. Spaces let one viral model become thousands of forks overnight.
Use a conflicted investor stack to sell neutrality. If your value depends on being unaligned, taking checks from competing strategics at once is the strongest possible signal of independence — and each one becomes a customer and channel.

What probably won't work for you

The playbook is reusable, but five preconditions kept it from being available to most teams.

The 2018–2019 BERT/PyTorch window cannot be reopened. It was a unique convergence — a research field shifting toward transformers, Google releasing in TensorFlow when most academics preferred PyTorch, no incumbent open-source library, and a research community small enough that one good port captured mindshare in months. A 2026 founder cannot open this play in NLP. Whether it can be opened in robotics is the live question Hugging Face's own LeRobot library is testing.

Academic credibility is not transferable. Thomas Wolf has been publishing under the Hugging Face affiliation since 2019, anchoring the academic leg of a triangulated founder-IP surface — Delangue on daily X presence, Wolf on papers and keynotes, Chaumond on infrastructure. A solo non-research founder cannot fake this, and hiring a chief scientist three years in does not produce the same effect.

Patient capital funded a non-category. Lux Capital led three consecutive rounds (Series A–C) before open-source AI infrastructure was even a venture category. A 2026 founder pitching the same thesis faces investors who already know how the outcome looks and price accordingly.

Bicultural French-American operations gave a structural edge. It opened access to French government funding (the Jean Zay supercomputer, the BLOOM project) and French research talent — plus the dual-Atlantic posture that supports the neutrality framing. And the honest limits matter: net dollar retention is undisclosed, the inference-vs-subscription margin split is opaque, and the library is literally named transformers — if state-space models or diffusion-native LLMs displace transformers, the substrate has to be retrofitted, which is harder than building it the first time.

Sources & references

Hugging Face wants to become your artificial BFF — TechCrunch
Hugging Face raises $15M to build the definitive NLP library — TechCrunch
Hugging Face raises $40M Series B — TechCrunch
Hugging Face reaches $2B valuation to build the GitHub of ML — TechCrunch
Hugging Face raises $235M from Salesforce, Google, Amazon, Nvidia and more — TechCrunch
Tech giants invest in Hugging Face — CNBC
HuggingFace's Transformers: State-of-the-art NLP — arXiv (Wolf et al.)
Hugging Face's chief science officer worries AI is becoming yes-men on servers — TechCrunch
Full timeline: The Hugging Face Growth Story — every round, acquisition, and milestone, charted

This case study is part of GrowthHunt's growth teardown series. For another substrate-first compounding story, read the Clay teardown; for the AI-rocket opposite, the Lovable teardown. Track the fastest-growing AI repos and founders live on GrowthHunt Velocity.

Keep reading

See it run in the wild.

Growth Story12 min

How Cursor Got to $1B ARR in 13 Months: Growth Teardown

Read the cursor story →

Growth Story12 min

How Lovable Hit $400M ARR in 15 Months: Growth Playbook

Read the lovable story →

Want this automated?

GrowthHunt does this for you.

Six free tools are live right now — no waitlist, no card, just click in.

Explore the live tools →