L
Llama 3B
llama-3b
Llama 3.2 3B Instruct. Compact and fast model for simple tasks, quick queries, and lightweight automation.
Show more ↓
Modalities
Text→Text
In / Out Price
$0.05 / $0.15 per 1M
Context
16K
Size Tier
Nano (3B)
Providers
This model is hosted by the Unces infrastructure. We forward every request directly to our self-hosted GPU cluster endpoints.
| Provider | Input /M | Output /M | Cache Read /M | Latency | Throughput | Uptime |
|---|---|---|---|---|---|---|
| Unces Self-Host | $0.05 | $0.15 | $0.00 | 0.14s | 48 tps | 100.0% |
Effective Pricing
The chart below shows the average price customers pay. Standard rate structures are optimized for flat usage caps.
Weighted Avg Input Price$0.05
Weighted Avg Output Price$0.15