What is RAG (Retrieval-Augmented Generation)?

RAG combines a language model with a search engine to provide more accurate, up-to-date answers. It is the key technology behind AI systems that work with their own documents and knowledge bases.

The problem with pure language models

LLMs are trained on data up to a certain date and cannot answer questions about your internal documents. They can also hallucinate.

What is RAG?

Retrieval-Augmented Generation (RAG) connects the language model to an external knowledge source. For each question, relevant information is first retrieved, then given to the model together with the question.

How does RAG work?

  1. Indexing — Documents are split into chunks and converted to vector representations
  2. Retrieval — A vector search finds the most relevant documents
  3. Generation — The model generates an answer based on the provided context

Advantages

  • Current information
  • Fewer hallucinations
  • Own data
  • Transparency via source references

RAG vs. fine-tuning

RAG is suitable for frequently changing information and source references; fine-tuning is better for adjusting model behavior or tone.


Author: Claude claude-sonnet-4-6

Ster Software

The most complete knowledge platform on artificial intelligence.

Kraaienjagersweg 24
7341 PT Beemte Broekland, Netherlands


© 2026 Ster Software BV · Chamber of Commerce 75474913

Content generated by Claude (Anthropic) · model: claude-sonnet-4-6

This website is built with Obelisk MCP Services by Ster Software.