Phase 0 POC: your model, your hardware, five days.
A fixed-scope, fixed-fee engagement that puts a near-lossless 5-bit pack of your model in your hands — with a signed reconstruction audit you can take to a regulator or a model-risk committee. No procurement gauntlet, no scoping calls that fan out into a quarter, no “let’s talk about your stack first.” One model, one pack, one report, five business days.
1.0066×
Hermes-3-405B PPL ratio on a single 32 GB consumer GPU
SHA-256
Reproducible reconstruction · verifiable in seconds with uc verify
5,300+
PyPI installs + HF pack pulls (30 d) · $0 paid acquisition
Three tiers
Anchor Design Partner
$0
For the first two regulated AI deployers willing to be a named case study.
- Same five-day POC delivered.
- You let us publish a one-page case study with your model class and PPL ratio.
- Two slots only. Likely closed before we can update this page.
Phase 0 POC
$5K
For teams who want to evaluate UltraCompress before they commit to a production rollout.
- Five business days, fixed-fee.
- One
.uc pack of your model, run end-to-end through your evaluation harness.
- SHA-256 reproducible reconstruction audit signed by Sipsa Labs.
- Payment via US wire or ACH after a one-page SOW.
Production engagement
Talk to us
Multi-model rollouts, custom architectures, regulated-vertical-specific audit packages.
- Scoped after the Phase 0 POC clears.
- Annual or per-model licensing.
- Includes the reconstruction-audit package required by FDA SaMD, SR 11-7, or DoD ATO programs.
What ships, exactly
- The
.uc pack itself. A 5-bit compressed artifact of the model you nominate. We’re working from your bf16 (or fp16) checkpoint. Storage on the order of one third of the original.
- The SHA-256 manifest. Computed over the reconstructed bf16 weights, recorded on the signed report. Recompute it at deploy time, at runtime, in your CI — equality means the bytes are exactly what we audited.
- The reconstruction tooling. A small Python entrypoint that loads the
.uc pack and returns bf16 weights. Reconstruction is deterministic and hardware-independent — you get the same bytes on an A100, an H100, an RTX 5090, or a CPU.
- The PPL-ratio measurement. A canonical perplexity comparison between the bf16 baseline and the reconstructed pack, on the same calibration set we use for our public benchmark suite (or yours, if you supply one).
- A one-page reconstruction audit report. Signed PDF, suitable for inclusion in an FDA SaMD pre-submission, an SR 11-7 model-risk packet, or a DoD ATO accreditation file. States the manifest, the reconstruction contract, the PPL ratio, and the measured maximum absolute reconstruction difference (which is 0.00e+00 fp32; not “small,” zero).
The five days
Day 1
You send us the model checkpoint (HF repo, S3 bucket, or signed transfer link). We confirm architecture coverage and lock the scope.
Day 2 – 3
Compression run. Round-trip reconstruction. SHA-256 manifest computed. Internal sanity check pass.
Day 4
PPL evaluation on your calibration set (or our canonical FineWeb-Edu / FineWeb default if you don’t provide one). Honest negative-result check.
Day 5
Audit report signed, pack and tooling shipped, reconstruction recipe walked through with your engineering lead on a 30-minute call.
What we need from you
- A bf16 (or fp16) model checkpoint we can pull. We support every transformer architecture in our public catalog end-to-end, and several others under active evaluation. MoE, state-space, and dense decoder-only models are all in scope.
- A calibration / evaluation set, if you have one you want PPL measured against. If you don’t, we use FineWeb-Edu (the same set we use for the public benchmark suite) and disclose that on the report.
- A single engineering point of contact for the kickoff and the close-out call.
- If you’re paying: a US wire or ACH transfer. We do not collect credit card data. We do not ask for an MSA before kickoff; we operate under our public Terms with a one-page POC SOW.
What this is not
- Not a fine-tuning service. We don’t train your model. We compress the artifact you give us, with reproducible reconstruction under a specific contract.
- Not lossless-with-respect-to-the-original. Five-bit quantization is lossy on the original weights; that’s the PPL ratio number on the report. What’s reproducible is the reconstruction — every load reproduces the SHA-256 manifest recorded at pack time, verifiable in seconds with
uc verify. That’s the property regulated buyers actually need.
- Not free-form research. If the model architecture isn’t one we’ve characterized publicly and you want it in five business days, we’ll tell you so on Day 1 and refund.
Start a Phase 0 POC
Two Anchor Design Partner slots remaining at $0 in exchange for case-study rights. Standard Phase 0 is $5K, five business days, your model on your hardware. Payment via US wire / ACH after a one-page SOW — no self-serve checkout, no credit card collection.
Submit the scope form below to request a Phase 0 engagement. We respond within 24 hours with a personalized scheduling link if it’s a fit. Engineers evaluating the tech first should take the free $5 credit path.
Path 1 · Recommended
Request a Phase 0 POC
Procurement-friendly intake. Fill the scope form, hit Send, your mail client opens with the inquiry pre-formatted to founder@sipsalabs.com. We respond within 24 hours with a personalized scheduling link if there’s a fit, and a one-page SOW.
Open the scope form ↓
Founder reviews every submission · 24-hour reply · personalized scheduling link sent after qualification.
Path 2
Try it first — free $5 credit
Engineers usually start here. Free $5 credit + API key — download a verified pack and run uc verify on your own hardware before you talk to sales. Same OpenAI-compatible SDK, three-second signup, no card.
Free $5 credit + API key →
Or read the public source · Healthcare · Defense
Sipsa Labs, Inc. · Delaware C-Corp · Patents pending · Security · Privacy · sipsalabs.com