Fine-tuning
Adapting a pre-trained model to a specific task or style with targeted training
What it is
Fine-tuning is the process of continuing training on a pre-trained model with a smaller, task-specific dataset. Rather than training from scratch, you start from a model that already understands language and update its weights to specialize for your use case.
Common applications: adjusting model style and persona, adding domain-specific knowledge, improving performance on a narrow task, and safety fine-tuning. Parameter-efficient fine-tuning methods like LoRA update only a small fraction of weights, making fine-tuning feasible on consumer hardware.
The risk: catastrophic forgetting, fine-tuning on new data can overwrite previously learned capabilities if not done carefully.