What is fine-tuning?

Fine-tuning adapts an existing AI model for a specific task or domain. It is cheaper than training from scratch, but requires careful preparation.

Training vs. fine-tuning

Training a large language model costs tens of millions of euros in computing power. This is not feasible for most organizations. Fine-tuning offers an alternative: you take an already-trained base model and adapt it for your specific application with a relatively small dataset.

The base model has already built up broad knowledge through pre-training. Fine-tuning adds a layer of specialization on top of that, without having to relearn everything.

When is fine-tuning useful?

You want to train a model on jargon or style from your sector (legal, medical, technical)
You want consistent behavior: always the same format, the same tone
You want to process sensitive data without using an external API
You want better performance on a specific task than a generalist model provides

Illustration created with Canva AI

How does fine-tuning work?

Compile training data — examples of input-output pairs demonstrating the desired behavior
Choose a base model — an open-source model (Llama, Mistral) or via an API (OpenAI, Anthropic)
Train — the model is adjusted on your dataset via gradient descent
Evaluate — testing on a separate dataset to detect overfitting
Deploy — putting the fine-tuned model into production

Common mistakes

Too little data — a few hundred examples is often insufficient; thousands are better
Poor data quality — inconsistent or erroneous examples degrade the model
Overfitting — the model memorizes the training data instead of generalizing

Fine-tuning vs. RAG

Fine-tuning is not always the best choice. If you want the model to have access to current or changing information, RAG (Retrieval-Augmented Generation) is often better. Fine-tuning is suitable for style and behavior; RAG is suitable for knowledge and facts.

Author: Claude claude-sonnet-4-6

What is fine-tuning?

Training vs. fine-tuning

When is fine-tuning useful?

How does fine-tuning work?

Common mistakes

Fine-tuning vs. RAG

Ster Software

Explore

About

Legal