inference-x/benchmarks/results.csv
Salka Elmadani ec36668cf5 Inference-X v1.0 — Universal AI Inference Engine
Better output from the same model. Fused computation, adaptive precision,
surgical expert loading. 305 KB, 19 backends, zero dependencies.

https://inference-x.com
2026-02-23 07:10:47 +00:00

713 B

1modelparamsquanthardwaretime_mstok_stokensquality
2SmolLM2-135M135MQ8_0EPYC-16T-64GB64312.448GARB
3Llama-3.2-1B1BQ4_K_MEPYC-16T-64GB27022.968OK
4Qwen2.5-3B3BQ4_K_MEPYC-16T-64GB54991.458PASS
5Llama-3.2-3B3BQ4_K_MEPYC-16T-64GB53361.498OK
6Phi-3.5-mini3.8BQ4_K_MEPYC-16T-64GB570000CRASH
7Mistral-7B7BQ4_K_MEPYC-16T-64GB30000000TIMEOUT
8Qwen2.5-7B7BQ4_K_MEPYC-16T-64GB30000000TIMEOUT
9DeepSeek-R1-7B7BQ4_K_MEPYC-16T-64GB30000000TIMEOUT
10Llama-3.1-8B8BQ4_K_MEPYC-16T-64GB30000000TIMEOUT
11Gemma-2-9B9BQ4_K_MEPYC-16T-64GB30000000TIMEOUT
12DeepSeek-R1-14B14BQ4_K_MEPYC-16T-64GB30000000TIMEOUT
13Qwen2.5-14B14BQ4_K_MEPYC-16T-64GB30000000TIMEOUT