Marc Andreessen & the "Sputnik" Moment

DeepSeek was not the start of this newest space race.

Jan 28, 2025

The continued dominance of DeepSeek coverage across the internet has led me to a tweet of MA on this being a “Sputnik moment”- something I wanted to interrogate a bit today.

(x.com/pmarca/status/1883640142591853011)

Don’t ignore causality- ChatGPT-3 was “Sputnik”

In October 1957, the Soviet Union launched Sputnik and the event triggered panic, a space race, and a reordering of global power dynamics. Today, MA claims that DeepSeek represents AI’s “Sputnik moment” . But this framing misidentifies the catalyst- the real Sputnik moment occurred two years earlier, in November 2022, when OpenAI released ChatGPT-3. For China, ChatGPT was the alarm bell; DeepSeek is the response.

The West’s reaction—a $500 billion infrastructure arms race—risks repeating the Soviet Union’s fatal error: overcommitting to a strategy of brute-force spending while adversaries innovate around constraints, and undermine hardware-centric leads.

Remember: ChatGPT-3 came out of nowhere

When ChatGPT-3 debuted, it wasn’t a product launch so much as an earthquake. For the first time, a U.S. company had demonstrated a generative AI system with near-human fluency, accessible to anyone with an internet connection. In China, the implications were immediate. State media warned of a “generational gap” in AI, while tech executives scrambled to replicate OpenAI’s breakthroughs. The CCP, already wary of U.S. tech dominance, moved some resources into domestic AI projects.

But China faced structural challenges. U.S. sanctions had restricted access to advanced Nvidia chips like the A100, forcing Chinese firms to rely on lower-performance alternatives. Training large language models under these constraints seemed impossible—until DeepSeek’s engineers turned limitation into advantage.

Innovation from Necessity

DeepSeek’s rise is a story of necessity-driven innovation. Founded in 2023 by Liang Wenfeng, a quantitative hedge fund manager, the company leveraged China’s stockpile of Nvidia H800 chips (sanction workaround chips) to train its models at a fraction of the cost of Western competitors. The paper they released goes through the exhaustive work they did in squeezing every ounce of performance out of those chips, and damn, the squeezing sure worked.

That wasn’t magic; it was efficiency. While OpenAI and Google poured billions into data centres and top-tier chips, DeepSeek optimized data usage and prioritized cost-effective reinforcement learning. The result? A model that performs comparably to ChatGPT-o1 at 3.65% of the cost per million outputs. As Marc Andreessen himself admitted, the implications are profound: “It’s a gift to the world”.

But to frame this as a “Sputnik moment” ignores causality. DeepSeek exists because ChatGPT exposed China’s vulnerability. The true parallel lies in 1957: ChatGPT was Sputnik; DeepSeek is the moon landing.

The Missile Gap Redux

In the 1950s, the Soviets fixated on a “missile gap” with uncle Sam; a perceived shortfall that later proved exaggerated. The panic fuelled a military-industrial buildup that distorted budgets and priorities. Today, OpenAI’s $500 billion “Stargate” project—a joint venture with SoftBank and Oracle to build AI data centers—risks replicating this dynamic.

The parallels are striking:

Hardware, Hardware, Hardware: Just as the Soviets stockpiled ICBMs, Western tech giants are racing to hoard Nvidia chips and build data centres. Nvidia’s valuation soared 200% in 18 months—until DeepSeek’s rise triggered a 17% single-day crash.
Neglect of Efficiency: OpenAI’s o1 model costs ~20–40x more to run than DeepSeek-R1 14. Yet Silicon Valley’s ethos—“scale at all costs”—assumes bigger models and pricier chips guarantee dominance.
Geopolitical Blind Spots: U.S. sanctions aimed to cripple China’s AI development. Instead, they spurred innovation with lower-end chips, and with a level playing field the real race will be AI diffusion, which China is positioned to crush the US at.

The lesson? Brute-force spending without efficiency gains is unsustainable. The Soviet Union collapsed in part because its economy couldn’t sustain military overreach. Similarly, OpenAI’s $500 billion bet assumes infinite growth in a finite world—ignoring energy constraints, chip shortages, and the democratizing force of open-source alternatives like DeepSeek.

Assumptions that might be wrong:

Hardware Equals Leadership: Nvidia’s crash shows that chip dominance is fragile. DeepSeek’s H800-trained models prove that a cracked infra team can make a massive difference. What else is out there to optimize?

Proprietary Models Win: OpenAI’s closed-source approach sure seems like the loser here. As Meta’s Yann LeCun noted, “Open-source models are surpassing proprietary ones”. DeepSeek is winning and giving away the recipes.

Scale Trumps Speed: DeepSeek’s rapid iteration is all about agility over scale. Meanwhile, U.S. firms are shackled by legacy infrastructure and investor expectations.

How fast will this go?

I don’t think DeepSeek is done yet. I think we’ll have another SotA model within two months, and they will push open-source models to ever-higher heights. These are the youthful, patriotic kids (like NASA in the 60’s) who are intent on winning the race.

So what’s to be done?

Embrace Open-Source: The lack of resources in open-source LLMs in the US is sad; not just compute but the entire stack. We should be investing and valorizing.
Prioritize Efficiency Over Scale: François Chollet was right when he made efficiency a key piece of ARC AGI benchmark. We need more things like this.
Rethink Sanctions: Export controls aimed at stifling China instead fuelled its ingenuity. I’m going to write more on this soon.

Marc Andreessen is right to call DeepSeek a wake-up call—but he’s diagnosing the wrong disease. The real crisis isn’t China’s rise; it’s the West’s failure to learn from its own history. We’re not the plucky underdogs this time around; we’re first out of the gate, with a lot to lose.

Discussion about this post

Ready for more?