Question 1

What is Pre-training?

Accepted Answer

The initial phase of model training where the network learns general knowledge from a massive dataset. Pre-training is the most expensive phase, often costing tens or hundreds of millions of dollars for frontier models.

Question 2

Why does Pre-training matter for business?

Accepted Answer

Pre-training is where a model's fundamental capabilities are established. Everything that follows, fine-tuning, alignment, deployment, builds on top of what was learned during pre-training. The cost and scale of pre-training create a natural oligopoly: only organizations with $100M+ budgets and access to thousands of GPUs can train frontier models from scratch. This makes pre-training a barrier to entry that shapes the entire competitive landscape. Understanding pre-training also explains why fine-tuning works: the model has already learned language, reasoning, and world knowledge during pre-training, so fine-tuning only needs to steer these existing capabilities toward specific tasks.

Pre-training

Related terms

Know the terms. Know the moves.