
Every local model — Llama, Gemma, Qwen, BitNet — generates with full confidence whether the answer is right or completely made up. There’s no internal signal that says “I’m not sure.”
σ-gate adds that signal. It reads the model’s hidden states during inference and computes a unified confidence score from five independent signals — cheapest first, stops when the verdict is clear.
Three outcomes: ACCEPT (trust it), RETHINK (regenerate), ABSTAIN (“I don’t know”).
The image explains the full pipeline.
12 bytes of state. C89. No floats. No malloc. No dependencies. No cloud. Runs locally on a MacBook Air at 5.8W.
Works with any GGUF model. Available as MCP plugin.
github.com/spektre-labs/creation-os