Better output from the same model. Fused computation, adaptive precision, surgical expert loading. 305 KB, 19 backends, zero dependencies. https://inference-x.com
21 lines
427 B
Markdown
21 lines
427 B
Markdown
---
|
|
name: Hardware Report
|
|
about: Share benchmark results or report hardware compatibility
|
|
title: '[Hardware] '
|
|
labels: hardware
|
|
---
|
|
|
|
## Hardware
|
|
- **CPU:**
|
|
- **GPU (if applicable):**
|
|
- **RAM:**
|
|
- **OS:**
|
|
|
|
## Benchmark Results
|
|
| Model | Params | Quant | tok/s | Prefill |
|
|
|-------|--------|-------|-------|---------|
|
|
| | | | | |
|
|
|
|
## Notes
|
|
<!-- Any issues, optimizations, or observations -->
|