Gemini 3.1 Pro

Other Generally Available

Higher-end version for complex workflows, stronger reasoning, and demanding real-world tasks.

Context
tokens
Input
per MTok
Output
per MTok

About

Higher-end version for complex workflows, stronger reasoning, and demanding real-world tasks.

API Parameters

Name Type Description
frequency_penalty number Penalize tokens by their frequency so far. Positive values reduce repetition.
max_completion_tokens integer Maximum number of tokens the model may generate in the response.
presence_penalty number Penalize tokens that have appeared at all so far. Positive values encourage new topics.
response_format one of Constrain output to a JSON schema or an enum (structured outputs).
seed integer Deterministic seed for sampling. Same seed + same prompt produces identical output.
stop array Up to 4 sequences where the API will stop generating tokens.
stream boolean Stream partial responses as Server-Sent Events.
temperature number Sampling temperature; higher values produce more random output. 0 is deterministic.
tool_choice one of Controls which (if any) tool is called: "none", "auto", "required", or a specific tool.
tools array List of tools (functions) the model may call.
top_p number Nucleus sampling: consider only tokens whose cumulative probability ≥ top_p.

Standard OpenAI-compatible parameters. Consult the provider docs for model-specific behaviour.