Question 1

What is Test-time compute?

Accepted Answer

The practice of allocating additional compute during inference to improve output quality, rather than relying solely on the capabilities baked in during training. Reasoning models and extended thinking are the primary examples of test-time compute scaling.

Question 2

Why does Test-time compute matter for business?

Accepted Answer

Test-time compute introduces a second scaling axis for AI capability. The first axis is training compute: bigger models trained on more data. The second is inference compute: more thinking time per problem. This is revolutionary because it means you can improve AI outputs without retraining the model, simply by spending more compute at inference time. For product developers, test-time compute creates a natural price-quality dial: simple questions get fast, cheap responses, while complex questions trigger extended reasoning at higher cost. For the industry, test-time compute scaling could sustain capability improvements even if training scaling laws begin to plateau.

Test-time compute

Related terms

Know the terms. Know the moves.