Claude Haiku

Fast small Claude for latency-sensitive tasks.

Illustrative specs. Context window, modalities, output cost, and EU availability for Claude Haiku are representative, last verified 1 Jun 2026. Verify against the provider before committing.

Hosting Bedrock Bedrock (AWS)

Context window 200,000 tokens

Modalities text, vision inputs

Output cost $4 / 1M tokens

Best for

Realtime assistants
High-volume classification

The selection wizard ranks Claude Haiku against every other model for your latency, context, modality, cost, and residency needs — and shows where it wins and where something else fits better.

Open the LLM Selection Wizard

What is Claude Haiku best for? Fast small Claude for latency-sensitive tasks. It fits Realtime assistants, High-volume classification.

What context window and modalities does Claude Haiku support? Claude Haiku handles up to 200,000 tokens of context and supports text, vision input. It runs on Bedrock (AWS).

How much does Claude Haiku cost? Around $4 per 1M output tokens (illustrative, verified 1 Jun 2026). Output tokens usually dominate the bill — verify input and cached pricing against the provider before budgeting.

Suggest improvement

Claude Haiku

Claude Haiku at a glance

Best for

Is it the right model?

Frequently asked questions