R1 vs. o1: AI as Commodity or Moat

Metadata

US - China AI model enjoyers Dabbing Dude meme

DeepSeek announced the release of its R1 reasoning model[1] a day before the US announced it would use its executive power to secure the infrastructure needs for a joint venture between OpenAI (developers of the o1 reasoning model), Microsoft, Oracle, and Softbank, titled ā€œStargateā€[2].

A few days later, markets have taken this as an existential threat to OpenAIā€™s dominance and energy/compute demand. This is a shot across the bow from DeepSeek and zooming out, represents the economic tug-of-war between China and the US, but I donā€™t think it spells disaster for the USā€™ ambitions ā€“ if US companies can continue pushing the performance frontier.

NVDA price action on 2025-01-27
Nvidia was down ~17% by end of day on the news.

The Facts

DeepSeek trained R1 for an estimated $5.6MM on A100 and H800 GPUs (not the latest-and-greatest GPUs available). End-user inference pricing is an order of magnitude cheaper than o1.

Input/Output Pricing for o1-Class Inference Models ($/1M Tokens)
Source: @RnaudBertrand.

Despite less capital expenditure, R1 has comparable performance to o1 due to several technical innovations. I donā€™t pretend to deeply understand everything on this front, so instead Iā€™ll direct you to this paper, which breaks down whatā€™s happening under the hood. Ultimately, R1 is essentially a drop-in replacement for o1.

Accuracy / Percentile (%) by Benchmark
Source: DeepSeek-AI.

This is incredible for the little guy: run your own model, privately, without the need for massive infrastructure.

R1 running locally on an iPhone
A distilled version of R1 running locally on an iPhone. Source: @localghost.

Is OpenAI Cooked?

First, the threshold for publicly available, open-source reasoning models will never be lower than this. HuggingFace is attempting to reproduce R1 fully in the open[3] which would make it an end-to-end public frontier model. I think this is a win for society.

Second, everyone should be significantly shortening their anticipated AI development timelines. The initial market reaction was a first order reaction (80% less compute! 80% less chip demand!) but I donā€™t see why energy infrastructure and chip manufacturing demand would be negatively impacted. Iā€™d more likely guess this is a Jevonā€™s paradox situation, where the excess capacity is immediately saturated to advance model performance at a now more rapid pace.

Itā€™s clear with the amount of investment the US and companies are ploughing into OpenAI, their vision for the future is one where OpenAI has the highest horsepower model and defends its moat simply by having the smartest, fastest, best model. Value capture in this version of the world accrues to the model developers.

Chinaā€™s approach, exemplified by R1, envisions a future with commodity AI models used as building blocks within other applications. Incentivizing open development of models with comparable performance to private frontier models ensures the US canā€™t develop an AI moat. China codified this objective in 2017 with the release of their, ā€œNew Generation Artificial Intelligence Development Planā€ (translated). Some sections that stood out to me:

Open-Source and Open. Advocate the concept of open-source sharing, and promote the concept of industry, academia, research, and production units each innovating and in principal pursuing joint innovation and sharing. Follow the coordinated development law for economic and national defense construction; promote two-way conversion and application for military and civilian scientific and technological achievements and co-construction and sharing of military and civilian innovation resources; form an all-element, multi-domain, highly efficient new pattern of civil-military integration.[4]

(2) Optimize arrangements to build AI innovation bases Guide existing AI-related national focus laboratories, corporate national focus laboratories, national engineering laboratories, and other such bases, and conduct research focused on an advanced direction of a new generation of AI.[5]

I donā€™t think itā€™s a stretch to compare OpenAIā€™s business model with that of IBM in the 80s and 90s[6]. History shows tech advancements trend towards commoditization. Weā€™ll see if the USā€™ investments can buck the trend.


  1. šŸš€ DeepSeek-R1 is here! ā†©ļøŽ

  2. Announcing the Stargate Project ā†©ļøŽ

  3. Open R1 ā†©ļøŽ

  4. Full Translation: Chinaā€™s ā€˜New Generation Artificial Intelligence Development Planā€™ (2017) ā†©ļøŽ

  5. Full Translation: Chinaā€™s ā€˜New Generation Artificial Intelligence Development Planā€™ (2017) ā†©ļøŽ

  6. History of IBM - 1975ā€“1992: Information revolution, rise of software and PC industries ā†©ļøŽ