Q
Qwepus 35B A3B
qwepus-35b-a3b
Flagship hybrid MoE+SSM model. 35B total parameters with only 3.6B active per token. Native 262K context with reasoning and thinking capabilities.
Show more ↓
Modalities
Text→Multimodal
In / Out Price
$3.00 / $9.00 per 1M
Context
131K
Size Tier
Flagship (35B)
Providers
This model is hosted by the Unces infrastructure. We forward every request directly to our self-hosted GPU cluster endpoints.
| Provider | Input /M | Output /M | Cache Read /M | Latency | Throughput | Uptime |
|---|---|---|---|---|---|---|
| Unces Self-Host | $3.00 | $9.00 | $0.00 | 0.14s | 48 tps | 100.0% |
Effective Pricing
The chart below shows the average price customers pay. Standard rate structures are optimized for flat usage caps.
Weighted Avg Input Price$3.00
Weighted Avg Output Price$9.00