Question 1

What is SFT (Supervised Fine-Tuning)?

Accepted Answer

The process of training a pre-trained model on a curated dataset of input-output examples that demonstrate the desired behavior. SFT is typically the first alignment step after pre-training, teaching the model to follow instructions and produce useful responses.

Question 2

Why does SFT (Supervised Fine-Tuning) matter for business?

Accepted Answer

SFT is the workhorse of model customization. While RLHF and DPO get more attention, SFT is where most of the behavioral shaping happens. The quality and diversity of your SFT dataset directly determines your model's instruction-following ability, domain knowledge, and output style. For enterprises building custom models, SFT is usually the highest-ROI investment: a few thousand high-quality examples can transform a generic base model into a domain expert. The key insight is that SFT example quality matters far more than quantity: 1,000 expert-written examples typically outperform 100,000 crowd-sourced ones.

SFT (Supervised Fine-Tuning)

Related terms

Know the terms. Know the moves.