1.5 KiB
1.5 KiB
IX Tools
Utilities for the Inference-X ecosystem
A collection of tools for working with GGUF models, benchmarking hardware, managing Inference-X deployments, and integrating with common workflows.
Tools
| Tool | Description |
|---|---|
ix-bench |
Hardware benchmark — measures t/s across backends |
ix-convert |
Convert models to GGUF from SafeTensors, PyTorch |
ix-pull |
Download GGUF models from Hugging Face |
ix-serve |
Production-ready IX server wrapper with auth, logging |
ix-proxy |
Load balancer across multiple IX instances |
ix-monitor |
Dashboard — GPU usage, t/s, active connections |
ix-chat |
Terminal chat UI with history and markdown rendering |
ix-embed |
Batch embedding generation tool |
Quick Start
git clone https://git.inference-x.com/inference-x/ix-tools
cd ix-tools && make
# Benchmark your hardware
./ix-bench --all-backends
# Download a model
./ix-pull qwen2.5-7b-instruct-q4_k_m.gguf
# Start production server
./ix-serve --model model.gguf --port 8080 --workers 4
# Monitor running instances
./ix-monitor --host localhost:8080
Hardware Benchmark Reference (Feb 2026)
Run ./ix-bench to measure your actual hardware.
Community results: git.inference-x.com/inference-x-community/ix-scout
inference-x.com · MIT License