F
Flash 30B
flash-30b
GLM-4.7 based MoE flash model with only 2B active parameters. Optimized for speed with specialized imatrix quantization.
Show more ↓
Modalities
Text→Text
In / Out Price
$2.00 / $6.00 per 1M
Context
16K
Size Tier
Large (30B)
Providers
This model is hosted by the Unces infrastructure. We forward every request directly to our self-hosted GPU cluster endpoints.
| Provider | Input /M | Output /M | Cache Read /M | Latency | Throughput | Uptime |
|---|---|---|---|---|---|---|
| Unces Self-Host | $2.00 | $6.00 | $0.00 | 0.14s | 48 tps | 100.0% |
Effective Pricing
The chart below shows the average price customers pay. Standard rate structures are optimized for flat usage caps.
Weighted Avg Input Price$2.00
Weighted Avg Output Price$6.00