Better output from the same model. Fused computation, adaptive precision, surgical expert loading. 305 KB, 19 backends, zero dependencies. https://inference-x.com
377 B
377 B
What does this PR do?
Type of change
- Bug fix
- New feature
- Performance improvement
- New backend / hardware support
- Documentation
- Other
Testing
makesucceeds- Tested with at least one model
- Benchmarked (if performance-related)