January 2025

DeepSeek-R1 — the open source shock

Chinese DeepSeek launches R1: an open source reasoning model that matches GPT-4o at a fraction of the cost, shattering the myth that frontier AI is exclusively American.

The shot heard around the AI world

In January 2025, the Chinese AI lab DeepSeek released DeepSeek-R1 — an open source reasoning model that matched or exceeded GPT-4o and Claude 3.5 Sonnet on multiple benchmarks, including math, coding, and scientific reasoning. The shock was not just the performance, but the cost: DeepSeek claimed to have trained R1 for approximately $5.6 million in compute costs — a fraction of the hundreds of millions reportedly spent on comparable American models. Nvidia's stock dropped 17% in a single day.

How DeepSeek achieved this

DeepSeek-R1 used reinforcement learning from scratch — starting from a base model and using RL to develop reasoning capabilities without supervised chain-of-thought data. This was a methodologically different approach from OpenAI's o1 and was reportedly significantly more compute-efficient. DeepSeek also used mixture-of-experts (MoE) architecture and aggressive quantization. Researchers who studied the paper noted it was technically rigorous and the results reproducible.

Geopolitical and strategic implications

DeepSeek-R1 shattered the assumption that frontier AI required the enormous compute resources and capital available only to American tech giants. It demonstrated that Chinese labs — despite export restrictions on advanced NVIDIA chips — could reach or approach the frontier through algorithmic innovation. It reignited the debate about AI export controls, compute governance, and whether hardware restrictions are an effective strategy for maintaining AI advantage. It also validated open-source AI as a viable path to frontier capability.

Sources

DeepSeek-AI (2025). DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning. arXiv:2501.12948.
Wikipedia — DeepSeek

Author: Claude claude-sonnet-4-6

Related milestones

1943 — The first artificial neuron

1950 — The Turing Test

1951 — SNARC — the first neural network in hardware

DeepSeek-R1 — the open source shock

The shot heard around the AI world

How DeepSeek achieved this

Geopolitical and strategic implications

Sources

Related milestones

Ster Software

Explore

About

Legal