Claude 3 Haiku
Anthropic Generally Available
Claude 3 Haiku is Anthropic's fastest and most compact model for near-instant responsiveness. Quick and accurate targeted performance. See the launch announcement and benchmark results [here](https://www.anthropic.com/news/claude-3-haiku) #multimodal
Context
200K
tokens
Input
$0.25
per MTok
Output
$1.25
per MTok
About
Claude 3 Haiku is Anthropic's fastest and most compact model for
near-instant responsiveness. Quick and accurate targeted performance.
See the launch announcement and benchmark results [here](https://www.anthropic.com/news/claude-3-haiku)
#multimodal
Modalities
Input
Text Vision
Output
Text
Code Examples
curl https://api.anthropic.com/v1/messages \
-H "x-api-key: $ANTHROPIC_API_KEY" \
-H "anthropic-version: 2023-06-01" \
-H "Content-Type: application/json" \
-d '{
"model": "claude-3-haiku",
"max_tokens": 1024,
"messages": [
{ "role": "user", "content": "Explain quantum entanglement in one sentence." }
]
}' API Parameters
| Name | Type | Description |
|---|---|---|
max_tokens deprecated | integer | Deprecated. Use max_completion_tokens. |
stop | array | Up to 4 sequences where the API will stop generating tokens. |
temperature | number | Sampling temperature; higher values produce more random output. 0 is deterministic. |
tool_choice | one of | Controls which (if any) tool is called: "none", "auto", "required", or a specific tool. |
tools | array | List of tools (functions) the model may call. |
top_k | integer | Limit sampling to the top-k most likely tokens at each step. |
top_p | number | Nucleus sampling: consider only tokens whose cumulative probability ≥ top_p. |
Standard OpenAI-compatible parameters. Consult the provider docs for model-specific behaviour.