Question 1

What is Encoder-decoder?

Accepted Answer

A neural network architecture where an encoder compresses input into a representation and a decoder generates output from it. T5 and BART use this pattern, contrasting with decoder-only models like GPT.

Question 2

Why does Encoder-decoder matter for business?

Accepted Answer

Understanding encoder-decoder versus decoder-only architectures helps you choose the right model for your task. Encoder-decoder models excel at tasks where you transform one sequence into another: translation, summarization, code transpilation. Decoder-only models (GPT, Claude, Llama) excel at open-ended generation and instruction following. The industry has largely consolidated around decoder-only for general-purpose LLMs, but encoder-decoder models remain superior for specific tasks and are more parameter-efficient for translation and structured output. For specialized production systems, knowing which architecture fits your task can save significant compute.

Encoder-decoder

Related terms

Know the terms. Know the moves.