Models/Flash 30B
F

Flash 30B

flash-30b
Quick Start

GLM-4.7 based MoE flash model with only 2B active parameters. Optimized for speed with specialized imatrix quantization.

Show more ↓
Modalities
TextText
In / Out Price
$2.00 / $6.00 per 1M
Context
16K
Size Tier
Large (30B)

Providers

This model is hosted by the Unces infrastructure. We forward every request directly to our self-hosted GPU cluster endpoints.

ProviderInput /MOutput /MCache Read /MLatencyThroughputUptime
Unces Self-Host$2.00$6.00$0.000.14s48 tps100.0%

Effective Pricing

The chart below shows the average price customers pay. Standard rate structures are optimized for flat usage caps.

Weighted Avg Input Price$2.00
Weighted Avg Output Price$6.00