dictionary

Fine-Tuning (PEFT, LoRA)

Fine-tuning adjusts the internal weights of a model so it performs better on a targeted task, like writing SQL queries or understanding medical terminology. Parameter-Efficient Fine-Tuning (PEFT) and Low-Rank Adaptation (LoRA) are modern techniques that allow developers to fine-tune massive models quickly and cheaply without retraining every parameter.

CategoryModels

Reading time4 min read

Last updatedFeb 19, 2025

Definition

The process of taking a pre-trained model and training it further on a smaller, specialized dataset to adapt it to specific tasks or domains.

Need this applied?

We help teams go from definitions to deployed workflows—safely and fast.

Start a project Book a strategy call

When to fine-tune

OpenAI Hugging Face

Fine-tuning is best used when you need to change the behavior, tone, or specific formatting of a model (e.g., generating JSON output perfectly every time). If you simply need the model to know new information, Retrieval-Augmented Generation (RAG) is usually faster and more reliable.

PEFT and LoRA

Hugging Face

Instead of updating all billions of parameters, PEFT/LoRA techniques freeze the original model and only train a small "adapter" module. This makes fine-tuning accessible to teams without massive GPU clusters.

FAQ

Is fine-tuning the same as RAG?

No. Fine-tuning bakes knowledge into the model’s weights, like teaching someone to speak Spanish. RAG gives the model an open textbook at query time without changing its weights.

OpenAI

Email this summary + checklist

Get a copy of “Fine-Tuning (PEFT, LoRA)” and an AI readiness checklist in your inbox.