BitNet-TRNQ

Ternary-quantized version of microsoft/bitnet-b1.58-2B-4T-bf16, packaged for the Trillim DarkNet inference engine.

This model runs entirely on CPU — no GPU required.

Model Details

pip install trillim
trillim pull Trillim/BitNet-TRNQ
trillim serve Trillim/BitNet-TRNQ

This starts an OpenAI-compatible API server at http://127.0.0.1:8000.

For interactive CLI chat:

trillim chat Trillim/BitNet-TRNQ

File	Description
`qmodel.tensors`	Ternary-quantized weights in Trillim format
`rope.cache`	Precomputed RoPE embeddings
`config.json`	Model configuration
`tokenizer.json`	Tokenizer
`tokenizer_config.json`	Tokenizer configuration
`trillim_config.json`	Trillim metadata

This model is released under the MIT License, following the license of the source model.

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Base model

Quantized

(6)

this model