Compare LLM Models

Side-by-side comparison of 1 model. Best values highlighted.

Model ID @cf/meta/llama-2-7b-chat-fp16
Status ga
Knowledge cutoff
Context window 4K tokens
Max output
Parameters
Open weights No
Pricing (per MTok)
Input $0.56
Output $6.67
Cache read
Cache write
Free tier No
Capabilities
Tool Use
Streaming
Prompt Caching
Batch API
Extended Thinking
Structured Outputs
Multi-turn Tool Calling
Agentic Workload Ready
Parallel Tool Calls
Vision Input
Audio Input
Audio Output
Modalities
Input
Output
Last updated May 9, 2026