Public roadmap — features, milestones, community voting.
https://inference-x.com
| LICENSE | ||
| README.md | ||
| ROADMAP.md | ||
Current Focus (Q1 2026)
- WebGPU backend (browser-native inference)
- LoRA adapter support
- Structured output (JSON schema enforcement)
- Model caching / lazy loading
Upcoming (Q2 2026)
- ONNX Runtime backend
- Speculative decoding (2-4× speed improvement)
- Multi-modal (vision) unification
- ix-scout public map launch
Completed
- 19 hardware backends (CUDA, Metal, Vulkan, ROCm, CPU...)
- OpenAI API compatibility
- ARM / RISC-V support
- ECHO multi-agent architecture (research)
- Organ Architecture (model surgery)
How to vote / contribute
Open an issue or comment on existing ones. Items with the most community interest get prioritized.
Last updated: Feb 2026 · inference-x.com