Neural network surgery — extract, transplant, and recombine GGUF model components. https://inference-x.com

fine-tuning gguf layers model neural-network

Go to file

Inference-X 4be971b42a docs: professional README — Feb 2026		2026-02-25 17:25:22 +00:00
README.md	docs: professional README — Feb 2026	2026-02-25 17:25:22 +00:00

README.md

Organ Architecture

Neural network surgery and checkpoint manipulation for GGUF models

inference-x.com · Community

What is Organ Architecture?

Organ Architecture treats neural networks like biological organisms — with organs (layers, heads, MLP blocks) that can be extracted, analyzed, transplanted, and recombined.

Features

Model inspection — visualize layer shapes, attention patterns, weight distributions
Selective extraction — extract specific layers or attention heads as "organs"
Cross-model transplant — graft organs between compatible GGUF models
Quantization surgery — selectively re-quantize individual layers
Architecture diff — compare two model checkpoints layer-by-layer

Quick Start

git clone https://git.inference-x.com/inference-x/organ-architecture
cd organ-architecture && make

# Inspect a model
./organ inspect model.gguf

# Extract attention layers 16-24
./organ extract model.gguf --layers 16-24 --output head_block.gguf

# Transplant into another model
./organ transplant donor.gguf recipient.gguf --source-layers 16-24 --target-layers 16-24

Supported Operations

Operation	Description
`inspect`	Show model architecture, layer shapes, parameter count
`extract`	Pull specific organs (layers, heads, MLPs)
`transplant`	Merge organs from one model into another
`diff`	Compare two models layer by layer
`requant`	Re-quantize selected layers
`prune`	Remove layers to create smaller variants

Part of the Inference-X ecosystem · See also: organ-store, organ-architect