Glossary · Definition

Fine-tuning

Fine-tuning is the process of further training a pretrained model on your specific data, baking in style, format, or domain knowledge that's hard to achieve with prompting alone.

Updated May 2026 · 4 min read

100% in-browserNo downloadsNo sign-upMalware-freeHow we keep this safe →

Definition

Fine-tuning is the process of further training a pretrained model on your specific data, baking in style, format, or domain knowledge that's hard to achieve with prompting alone.

What it means

Three categories matter in 2026: full fine-tuning (rare for foundation models — too expensive), LoRA / PEFT (parameter-efficient, the standard), and RLHF / DPO (alignment fine-tuning). OpenAI, Anthropic, and Google all offer hosted fine-tuning APIs at $25-100 per million training tokens. Open-weight models (Llama, Qwen, DeepSeek) can be fine-tuned anywhere using libraries like Unsloth, Axolotl, or Hugging Face PEFT.

Why it matters

Most production teams skip fine-tuning until prompting and RAG hit a quality ceiling. Fine-tuning is the right move when: you need consistent format/style not achievable with examples, your domain has terminology the base model doesn't know well, or you're optimizing inference cost (smaller fine-tuned model > prompt-engineered larger one).

Related free tools

Free toolAI Cost EstimatorEstimate daily, monthly, and yearly API cost for GPT-4o, Claude, Gemini, and more based on your traffic and token usage.Open tool →

Frequently asked questions

Fine-tuning vs RAG?

RAG: retrieve facts at query time. Fine-tuning: bake style/format/terminology into model. They complement; you use both for serious products.

Cost?

$25-100 per million training tokens on hosted APIs. Open-weight LoRA fine-tuning runs $50-500 of GPU time depending on model size + dataset.

What it means

Why it matters

Related free tools

Frequently asked questions

Related terms