Question 1

What is Jailbreak?

Accepted Answer

A technique for bypassing an AI model's safety guardrails to elicit outputs the model was trained to refuse, such as harmful instructions, restricted content, or system prompt leaks.

Question 2

Why does Jailbreak matter for business?

Accepted Answer

Jailbreaks expose the fragility of current safety mechanisms. If a motivated user can bypass your guardrails with a cleverly worded prompt, your safety measures are a speed bump, not a wall. This matters for product liability, regulatory compliance, and brand reputation. The jailbreak arms race is ongoing: researchers discover new bypass techniques, labs patch them, and new bypasses emerge. For companies deploying AI, the implication is that you cannot rely solely on the model provider's safety training; you need application-level guardrails, output monitoring, and incident response plans. Assuming your model is jailbreak-proof is the most dangerous assumption in production AI.

Jailbreak

Related terms

Know the terms. Know the moves.