Compare LLM Models

Side-by-side comparison of 1 model. Best values highlighted.

Model ID @cf/ibm-granite/granite-4.0-h-micro
Status ga
Knowledge cutoff
Context window 131K tokens
Max output
Parameters
Open weights No
Pricing (per MTok)
Input $0.017
Output $0.11
Cache read
Cache write
Free tier No
Capabilities
Tool Use
Streaming
Prompt Caching
Batch API
Extended Thinking
Structured Outputs
Multi-turn Tool Calling
Agentic Workload Ready
Parallel Tool Calls
Vision Input
Audio Input
Audio Output
Modalities
Input
Output
Last updated May 9, 2026