IX Tools

Utilities for the Inference-X ecosystem

A collection of tools for working with GGUF models, benchmarking hardware, managing Inference-X deployments, and integrating with common workflows.

Tools

Tool	Description
`ix-bench`	Hardware benchmark — measures t/s across backends
`ix-convert`	Convert models to GGUF from SafeTensors, PyTorch
`ix-pull`	Download GGUF models from Hugging Face
`ix-serve`	Production-ready IX server wrapper with auth, logging
`ix-proxy`	Load balancer across multiple IX instances
`ix-monitor`	Dashboard — GPU usage, t/s, active connections
`ix-chat`	Terminal chat UI with history and markdown rendering
`ix-embed`	Batch embedding generation tool

Quick Start

git clone https://git.inference-x.com/inference-x/ix-tools
cd ix-tools && make

# Benchmark your hardware
./ix-bench --all-backends

# Download a model
./ix-pull qwen2.5-7b-instruct-q4_k_m.gguf

# Start production server
./ix-serve --model model.gguf --port 8080 --workers 4

# Monitor running instances
./ix-monitor --host localhost:8080

Hardware Benchmark Reference (Feb 2026)

Run ./ix-bench to measure your actual hardware.
Community results: git.inference-x.com/inference-x-community/ix-scout

inference-x.com · MIT License

1.5 KiB Raw Blame History

IX Tools

Tools

Quick Start

Hardware Benchmark Reference (Feb 2026)

1.5 KiB

Raw Blame History