GPT-5 mini

Balanced reasoning at a fraction of GPT-5's price.

Illustrative specs. Context window, modalities, output cost, and EU availability for GPT-5 mini are representative, last verified 1 Jun 2026. Verify against the provider before committing.

Hosting Foundry Foundry (Azure)

Context window 400,000 tokens

Modalities text, vision inputs

Output cost $2 / 1M tokens

Best for

Mid-complexity reasoning at scale
Cost-aware production workloads

The selection wizard ranks GPT-5 mini against every other model for your latency, context, modality, cost, and residency needs — and shows where it wins and where something else fits better.

Open the LLM Selection Wizard

What is GPT-5 mini best for? Balanced reasoning at a fraction of GPT-5's price. It fits Mid-complexity reasoning at scale, Cost-aware production workloads.

What context window and modalities does GPT-5 mini support? GPT-5 mini handles up to 400,000 tokens of context and supports text, vision input. It runs on Foundry (Azure).

How much does GPT-5 mini cost? Around $2 per 1M output tokens (illustrative, verified 1 Jun 2026). Output tokens usually dominate the bill — verify input and cached pricing against the provider before budgeting.

Suggest improvement

GPT-5 mini

GPT-5 mini at a glance

Best for

Is it the right model?

Frequently asked questions