Models/Llama 3B

Llama 3B

llama-3b

Llama 3.2 3B Instruct. Compact and fast model for simple tasks, quick queries, and lightweight automation.

Show more ↓

Modalities

Text→Text

In / Out Price

$0.05 / $0.15 per 1M

Context

16K

Size Tier

Nano (3B)

Providers

This model is hosted by the Unces infrastructure. We forward every request directly to our self-hosted GPU cluster endpoints.

Provider	Input /M	Output /M	Cache Read /M	Latency	Throughput	Uptime
Unces Self-Host	$0.05	$0.15	$0.00	0.14s	48 tps	100.0%

The chart below shows the average price customers pay. Standard rate structures are optimized for flat usage caps.

Weighted Avg Input Price$0.05

Weighted Avg Output Price$0.15