Better output from the same model. Fused computation, adaptive precision, surgical expert loading. 305 KB, 19 backends, zero dependencies. https://inference-x.com
427 B
427 B
| name | about | title | labels |
|---|---|---|---|
| Hardware Report | Share benchmark results or report hardware compatibility | [Hardware] | hardware |
Hardware
- CPU:
- GPU (if applicable):
- RAM:
- OS:
Benchmark Results
| Model | Params | Quant | tok/s | Prefill |
|---|---|---|---|---|