Models/Flash 30B

Flash 30B

flash-30b

GLM-4.7 based MoE flash model with only 2B active parameters. Optimized for speed with specialized imatrix quantization.

Show more ↓

Modalities

Text→Text

In / Out Price

$2.00 / $6.00 per 1M

Context

16K

Size Tier

Large (30B)

Providers

This model is hosted by the Unces infrastructure. We forward every request directly to our self-hosted GPU cluster endpoints.

Provider	Input /M	Output /M	Cache Read /M	Latency	Throughput	Uptime
Unces Self-Host	$2.00	$6.00	$0.00	0.14s	48 tps	100.0%

The chart below shows the average price customers pay. Standard rate structures are optimized for flat usage caps.

Weighted Avg Input Price$2.00

Weighted Avg Output Price$6.00