Run a prompt against a near-lossless 5-bit model. Right now.

No signup, no install, no API key. The weights serving this are stored at 5 bits per weight and reconstruct bit-for-bit to the validated artifact recorded in the SHA-256 manifest (the compressed pack, not the original full-precision model) — confirm the bytes match with one uc verify on your own machine.

SHA-256 verifiable 22 PPL-verified (17 dense + 4 MoE + 1 SSM) + 1 ViT cosine-verified Patents pending
5 bpw
storage format
SHA-256
reproducible reconstruction
1.0066×
405B headline PPL ratio
$0
to try this page

Tip: the warm sipsa-qwen3-0.6b is a tiny 0.6B model — keep prompts short and direct (a fact, a one-liner, a haiku). It is here to prove the compressed weights reconstruct exactly, not to win reasoning benchmarks. For larger models, grab a free API key.

Live output (streaming from api.sipsalabs.com)

Click Run to send a prompt to the live compressed model.
Want your own key? Self-serve API access ships with $5 free credits, no card. Use the same model menu (22 PPL-verified + 1 ViT cosine-verified, OpenAI-compatible endpoint at api.sipsalabs.com/v1). → /get-access · /pricing

Sample interaction (no JS required)

prompt → sipsa-qwen3-0.6b

Q. Write a haiku about a model file weighing five bits per weight.

A. Five bits per weight, light —
a thousand layers folded small,
the same words emerge.

verification

Output above produced by the same model file you can download from SipsaLabs/qwen3-0.6b-uc-v3-bpw5. SHA-256 manifest on the HF page; uc verify confirms pack structure + download integrity.