2009
ImageNet
Fei-Fei Li launches ImageNet: a dataset of 14 million labeled images that becomes the benchmark for image recognition and drives the deep learning revolution.
The dataset that changed everything
In 2009, computer scientist Fei-Fei Li and her team at Princeton University (later Stanford) published ImageNet — a dataset of more than 14 million images, manually labeled with more than 20,000 categories. The project had taken three years and involved tens of thousands of crowd workers via Amazon Mechanical Turk. ImageNet was not just a dataset; it was a new standard for what machine learning benchmarks should be.
The ImageNet Large Scale Visual Recognition Challenge
In 2010, Li launched the ImageNet Large Scale Visual Recognition Challenge (ILSVRC) — an annual competition where teams competed to build the most accurate image classifier. In the first two years, the best systems achieved error rates around 26%. Then came 2012: AlexNet, a deep convolutional neural network by Krizhevsky, Sutskever, and Hinton, achieved an error rate of 15.3% — almost half the error of the second-best system. This dramatic result demonstrated the power of deep learning and triggered the revolution that would reshape the entire technology industry.
Fei-Fei Li's vision
Li's insight was that the bottleneck in AI was not algorithms but data. While other researchers focused on improving models, she argued that a model is only as good as the data it learns from. ImageNet gave researchers a common, large-scale benchmark to measure progress objectively. Her vision — that AI needed massive amounts of labeled data to approach human-level performance — proved prescient and is still central to AI development today.
Sources
- Deng, J. et al. (2009). ImageNet: A Large-Scale Hierarchical Image Database. CVPR 2009.
- Li, F.F. (2023). The Worlds I See: Curiosity, Exploration, and Discovery at the Dawn of AI. Flatiron Books.
- Wikipedia — ImageNet