Low-cost small model for high-volume, latency-sensitive tasks.
Same context family as GPT-4o at a fraction of the token cost. Reach for it first on anything where a smaller model is good enough.
Run your own token counts and volume.
The calculator opens pre-set to GPT-4o mini. Enter your input/output token sizes and monthly requests to see pay-as-you-go cost and the PTU break-even for your workload.
Open the calculator with GPT-4o miniGPT-4o mini cost on Azure OpenAI.