Question 1

What is LoRA (Low-Rank Adaptation)?

Accepted Answer

A parameter-efficient fine-tuning technique that injects small trainable matrices into a frozen model. LoRA lets companies customize large models at a fraction of the cost of full fine-tuning.

Question 2

Why does LoRA (Low-Rank Adaptation) matter for business?

Accepted Answer

LoRA democratized model customization. Before LoRA, fine-tuning a 70B-parameter model required tens of GPUs and weeks of training. LoRA reduces this to a single GPU and hours by only training 0.1-1% of the model's parameters. The business impact is enormous: companies can now customize frontier models for specific domains, output formats, and brand voices without massive infrastructure investment. LoRA also enables serving multiple custom model versions efficiently: the base model stays in memory and different LoRA adapters are hot-swapped per request. This is how companies can offer personalized AI to thousands of enterprise customers from a single GPU cluster.

LoRA (Low-Rank Adaptation)

Related terms

Know the terms. Know the moves.