Compare LLM Models

Side-by-side comparison of 1 model. Best values highlighted.

Model ID Qwen/Qwen2.5-72B-Instruct
Status ga
Knowledge cutoff September 2024
Context window 131K tokens
Max output 8K tokens
Parameters 72B
Open weights Yes
Pricing (per MTok)
Input $0.40
Output $1.20
Cache read
Cache write
Free tier No
Capabilities
Tool Use
Streaming
Prompt Caching
Batch API
Extended Thinking
Structured Outputs
Multi-turn Tool Calling
Agentic Workload Ready
Parallel Tool Calls
Vision Input
Audio Input
Audio Output
Modalities
Input text, code
Output text, code
Last updated