Compare LLM Models

Side-by-side comparison of 1 model. Best values highlighted.

Model ID meta-llama/Llama-4-Scout-17B-16E
Status ga
Knowledge cutoff March 2025
Context window 10M tokens
Max output 8K tokens
Parameters 109B (17B active)
Open weights Yes
Pricing (per MTok)
Input $0.080
Output $0.30
Cache read
Cache write
Free tier No
Capabilities
Tool Use
Streaming
Prompt Caching
Batch API
Extended Thinking
Structured Outputs
Multi-turn Tool Calling
Agentic Workload Ready
Parallel Tool Calls
Vision Input
Audio Input
Audio Output
Modalities
Input text, vision, code
Output text, code
Last updated May 9, 2026