2006

The deep learning breakthrough

Geoffrey Hinton publishes a method to effectively train deep neural networks via pre-training, reopening research into neural networks.

Hinton and the resurrection of neural networks

In 2006, Geoffrey Hinton and his colleagues at the University of Toronto published two papers that would transform the field of machine learning. In A Fast Learning Algorithm for Deep Belief Nets (with Simon Osindero and Yee-Whye Teh) and a companion piece in Science, Hinton demonstrated that deep neural networks — networks with many hidden layers — could be effectively trained using a two-phase approach: unsupervised pre-training layer by layer, followed by supervised fine-tuning. This solved the vanishing gradient problem that had made training deep networks practically infeasible.

Why it mattered

Since Minsky and Papert's Perceptrons (1969) and the second AI winter, neural networks had been largely marginalized in favor of support vector machines and other statistical methods. Hinton's 2006 papers showed that depth — many layers of nonlinear transformations — enabled networks to learn increasingly abstract representations of data. This was the theoretical justification for what would become known as deep learning. Hinton coined the term himself and for years was largely alone in his conviction that this approach would transform AI.

Path to AlexNet and beyond

The 2006 breakthrough did not immediately dominate the field — that took until 2012, when AlexNet won the ImageNet competition by a massive margin. But it reopened research into neural networks, attracted new PhD students, and laid the groundwork for the decade of breakthroughs to come. Hinton, Yann LeCun, and Yoshua Bengio — the trio often called the "Godfathers of Deep Learning" — received the Turing Award in 2018 for their contributions.


Sources

  • Hinton, G.E. et al. (2006). A Fast Learning Algorithm for Deep Belief Nets. Neural Computation, 18(7), 1527–1554.
  • LeCun, Y., Bengio, Y. & Hinton, G. (2015). Deep learning. Nature, 521, 436–444.
  • Wikipedia — Deep learning

Ster Software

The most complete knowledge platform on artificial intelligence.

Kraaienjagersweg 24
7341 PT Beemte Broekland, Netherlands


© 2026 Ster Software BV · Chamber of Commerce 75474913

Content generated by Claude (Anthropic) · model: claude-sonnet-4-6

This website is built with Obelisk MCP Services by Ster Software.