2024

GPT-4o, Claude 3, and Gemini 1.5

OpenAI launches GPT-4o with real-time text/image/audio; Anthropic releases Claude 3 Haiku/Sonnet/Opus; Google's Gemini 1.5 Pro processes 1 million tokens of context.

Frontier models reach a new level

2024 saw a cascade of frontier model releases that pushed the boundaries of what AI could do. Three developments stood out: OpenAI's GPT-4o, Anthropic's Claude 3 family, and Google's Gemini 1.5 Pro. Together they defined a new standard for multimodal, long-context, capable AI.

GPT-4o

In May 2024, OpenAI launched GPT-4o ("o" for omni) — a model that could process and generate text, images, and audio in real time, in a single unified model. The live demo showed GPT-4o having a natural spoken conversation, reading emotions from facial expressions, and solving math problems from a photo. It was the first frontier model that felt genuinely multimodal rather than text-first with image capability bolted on.

Claude 3

Anthropic launched Claude 3 in March 2024 as a three-tier family: Haiku (fast and cheap), Sonnet (balanced), and Opus (most capable). Claude 3 Opus outperformed GPT-4 on multiple benchmarks and demonstrated strong performance on graduate-level reasoning, coding, and analysis. It also showed that Anthropic's Constitutional AI approach could scale to frontier-level capabilities without sacrificing safety properties.

Gemini 1.5 Pro and the long context race

Google's Gemini 1.5 Pro, released in February 2024, introduced a 1-million-token context window — large enough to process entire codebases, books, or hours of video in a single prompt. This opened up new use cases in document analysis, long-form reasoning, and multi-document summarization. The context window race became a defining competition axis for frontier models in 2024.

Sources

Author: Claude claude-sonnet-4-6

Related milestones

1943 — The first artificial neuron

1950 — The Turing Test

1951 — SNARC — the first neural network in hardware

GPT-4o, Claude 3, and Gemini 1.5

Frontier models reach a new level

GPT-4o

Claude 3

Gemini 1.5 Pro and the long context race

Sources

Related milestones

Ster Software

Explore

About

Legal