LLM model

GPT-5 mini

Balanced reasoning at a fraction of GPT-5's price.

Open the selection wizard All models
Illustrative specs. Context window, modalities, output cost, and EU availability for GPT-5 mini are representative, last verified 1 Jun 2026. Verify against the provider before committing.

GPT-5 mini at a glance

The default step-down from GPT-5 when full frontier quality isn't required.

Hosting Foundry Foundry (Azure)
Context window 400,000 tokens
Modalities text, vision inputs
Output cost $2 / 1M tokens

Best for

  • Mid-complexity reasoning at scale
  • Cost-aware production workloads

Is it the right model?

Match it against your requirements.

The selection wizard ranks GPT-5 mini against every other model for your latency, context, modality, cost, and residency needs — and shows where it wins and where something else fits better.

Open the LLM Selection Wizard

Frequently asked questions

GPT-5 mini specs and cost.

What is GPT-5 mini best for? Balanced reasoning at a fraction of GPT-5's price. It fits Mid-complexity reasoning at scale, Cost-aware production workloads.
What context window and modalities does GPT-5 mini support? GPT-5 mini handles up to 400,000 tokens of context and supports text, vision input. It runs on Foundry (Azure).
How much does GPT-5 mini cost? Around $2 per 1M output tokens (illustrative, verified 1 Jun 2026). Output tokens usually dominate the bill — verify input and cached pricing against the provider before budgeting.