EfficientOpenAI

GPT-4o mini

Context

128K tokens

Pricing

$0.15/M input, $0.60/M output

Modalities

text, image, code

Released

Jul 2024

Overview: OpenAI's most cost-effective model, offering strong performance across text and vision tasks at a fraction of GPT-4o's cost. GPT-4o mini is optimized for high-volume, low-latency applications.
Why it matters: At $0.15/M input tokens, GPT-4o mini demolished the price floor for 'good' AI. It is smart enough for most classification, extraction, and summarization tasks while being cheap enough to call on every single user action. For product teams, this is the model that makes AI features economically viable in freemium products and high-volume workflows. Its quality-to-cost ratio forced every competitor to release or accelerate their own efficient tier. If your use case does not require frontier reasoning, GPT-4o mini is often the pragmatic default.

Key strengths