GPT-4.1
OpenAI Generally Available
GPT-4.1 is a flagship large language model optimized for advanced instruction following, real-world software engineering, and long-context reasoning. It supports a 1 million token context window and outperforms GPT-4o and...
Context
1M
tokens
Input
$2.00
per MTok
Output
$8.00
per MTok
About
GPT-4.1 is a flagship large language model optimized for advanced instruction following, real-world software engineering, and long-context reasoning. It supports a 1 million token context window and outperforms GPT-4o and...
Modalities
Input
Vision Text file
Output
Text
Code Examples
curl https://api.openai.com/v1/chat/completions \
-H "Authorization: Bearer $OPENAI_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "gpt-4.1",
"messages": [
{ "role": "user", "content": "Explain quantum entanglement in one sentence." }
]
}' API Parameters
| Name | Type | Description |
|---|---|---|
max_completion_tokens | integer | Maximum number of tokens the model may generate in the response. |
max_tokens deprecated | integer | Deprecated. Use max_completion_tokens. |
response_format | one of | Constrain output to a JSON schema or an enum (structured outputs). |
seed | integer | Deterministic seed for sampling. Same seed + same prompt produces identical output. |
structured_outputs | boolean | Enable JSON-schema-constrained output. |
temperature | number | Sampling temperature; higher values produce more random output. 0 is deterministic. |
tool_choice | one of | Controls which (if any) tool is called: "none", "auto", "required", or a specific tool. |
tools | array | List of tools (functions) the model may call. |
top_p | number | Nucleus sampling: consider only tokens whose cumulative probability ≥ top_p. |
Standard OpenAI-compatible parameters. Consult the provider docs for model-specific behaviour.