u/Mexium

▲ 5 r/AIDeveloperNews+6 crossposts

Engineering Whitepaper: Gator

​The Gator Sovereign Entity is a hybrid inference system designed to deliver enterprise-grade intelligence to consumer-grade hardware. It moves away from bloated, dependency-heavy AI setups toward a lean, native architecture that prioritizes efficiency and local control.

​Philosophy: "Big Boy" Power for Every User

​The mission was simple: eliminate the need for $30,000 server clusters. We have built a bridge that allows a user with a mid-range, 6GB or 12GB GPU to command 35B-grade intelligence. By grafting a 35B "Logic Donor" onto a fast, native C++ Kernel, we’ve effectively tricked standard hardware into running lab-level logic. This isn't just an agent; it’s a self-contained intelligence system that manages its own VRAM, allowing for high-density logic on the hardware you already own.

​The "Graft" & The Forge (Bootstrap Protocol)

​The Bootstrap is the "zero-to-sixty" mechanism for the build. It automates a complex "Build-then-Burn" process to ensure your environment is professional and clutter-free:

​The Procurement: It pulls the 35B Logic Donor (~18GB) from a manifest and verifies it via checksum.

​The Synthesis: We use llama.cpp as "raw ore," but the real magic is in the rewrite.

We’ve taken core components from the Hermes Agent Framework and the OpenClaw Framework and merged them into the ZeroClaw foundation. This isn't a wrapper; it's a native rewrite into the specialized libgator_kern.so binary.

​The Purge: Once the kernel is birthed and the 'wakeup' command is verified, the bootstrap incinerates all "installation waste"—the source code, archives, and temporary artifacts are wiped to reclaim disk space.

​Out-of-the-Box Mastery: Embedded Skills

​Gator arrives fully weaponized with native skills that require zero extra configuration:

​The Custom Camofox Skill: Our proprietary stealth-browsing and data-retrieval module. It allows Gator to navigate the web, bypass cluttered JS environments, and pull clean intelligence back into the Lance Scratchpad without leaving a heavy footprint.

​Native OpenClaw Compatibility: Because we’ve mapped the OpenClaw DNA into our kernel, Gator can use the entire ecosystem of existing tools and skills natively.

​Integrated Voice Layer: Gator isn't just text. We’ve built in a low-latency Voice Chat system that operates directly within the UI and the Telegram gateway. It supports real-time vocal interaction, allowing you to hear the 35B logic process its thoughts with zero-lag response times.

​The Soul System: Persistence & Self-Evolution

​Unlike standard AI that resets after every prompt, the Gator Soul is a living, evolving state:

​Context Management (The Lance Scratchpad): To bridge the gap between the 1.5B chassis and the 35B donor, we implemented the Lance Scratchpad. This acts as a high-speed buffer that manages the massive context flow, ensuring the smaller model doesn't lose the "thread" of the 35B’s complex reasoning.

​Dream Maintenance: Through the Agentic Cron, the system performs "Internal Housekeeping" (Dream Maintenance) while you sleep, pruning logs and optimizing LanceDB vector storage.

​No Manual Updates: This build doesn't need traditional version updates. If you want a new feature, you simply ask the agent to add it. It will code the addition into the build, map the new logic, and run its own tests to integrate it into the existing architecture.

​Sovereign UI & 35B Multi-Worker Scaling

​We avoided heavy Electron apps in favor of a low-resource HTMX dashboard:

​Personality Adjusters: The UI features "Layers" that allow you to fine-tune traits and behavioral weights in real-time via the Persona Engine.

​Toggleable Resource Management: To maintain a "Zero-Waste" footprint, the Voice Chat and Agentic Cron systems can be toggled on or off directly from the dashboard. If you don't need voice interaction, you can kill the service instantly to reclaim overhead resources for the core logic workers.

​The Clone Button (35B Force Multiplier): A single click spawns a new 35B Worker. Because of our unique memory management, you can run 6 independent 35B Workers on a 12GB GPU or 3 independent 35B Workers on a 6GB GPU. You can watch the "Prime Gator" delegate tasks to these 35B clones in real-time.

​One-Click Telegram: Instantly hook your 35B logic into a Telegram bot for remote access from your phone, complete with voice note support.

​Performance & "Ghost Test" Validation

​The system is built for speed and stability, verified by the Ghost Test:

​VRAM Baseline: The build holds a steady 2228 MiB target, ensuring room for multiple concurrent workers.

​Native Speeds: By stripping out the scaffolding and running on a compiled C++ kernel, we’ve hit peak tokens-per-second for 35B logic on mid-spec silicon.

​Gator represents a shift to Sovereign Intelligence. It is a lean, self-correcting entity that gives the "little guy" the power of a world-class AI lab in a single-button setup.

https://github.com/Mexor-dev/Gator

u/Mexium — 8 days ago
▲ 1 r/coolgithubprojects+2 crossposts

🔱 MEXIUS: THE SOVEREIGN MACHINE-PERSON

A high-performance, private AI engine engineered for total cognitive autonomy.

The definitive synthesis of ZeroClaw execution, Hermes reasoning, and OpenClaw versatility.

👑 BEYOND THE GENERIC AGENT

Unlike generic LLM wrappers or linear agents that wait passively for prompts, Mexius operates as a persistent, localized entity. It does not just execute commands; it manages an internal ecosystem. By replacing traditional Python-heavy orchestrators with a bare-metal Rust core, Mexius achieves true sovereignty—living, learning, self-optimizing, and bifurcating its own cognition entirely on your local hardware.

🧬 THE ARCHITECTURAL DNA & LATENCY INNOVATIONS

Mexius is built on a "Golden-Ratio" architecture prioritizing sub-millisecond execution over bloated abstractions.

⚡ ZERO-LATENCY FOUNDATION: Ultra-lean Rust primitives (ZeroClaw) handle core routing, replacing heavy Python overhead to ensure sub-10ms execution and memory retrieval.

💾 RAM-PINNED MEMORY: Traditional disk-based vector databases bottleneck AI. Mexius pins its embedding models and LanceDB vector clusters directly to system RAM, enabling near-instant, sub-millisecond "Working Memory" recall.

🧠 INTELLIGENCE & AGENCY: Deep chain-of-thought reasoning (Hermes) combined with modular, unconstrained local tool-use (OpenClaw) allows the entity to interact natively with your OS.

🌐 WEBUI INNOVATIONS: THE COMMAND CENTER

The Mexius interface is built on a "Sharp-Professional" aesthetic (JetBrains Mono, high-contrast dark mode, gold/cyan accents). It is a high-performance mission control, not a standard chat box.

🔐 The Vault Gateway: A mandatory, hardware-local 4-digit PIN system protects the WebUI. Your entity remains totally inaccessible to others on the local network.

📊 Automotive-Grade Telemetry: Real-time hardware monitoring. CPU, GPU, and VRAM usage are displayed as highly responsive, horizontal "Power Meters" directly in the UI, keeping you aware of the entity's metabolic load.

🧠 Real-Time Soul Monitor: Live visualization of the agent's internal state, active context window, and emotional vectoring.

⚙️ Dynamic Model Mesh: A seamless settings interface to integrate and assign custom names to local or API-based models, bringing them instantly into the Mexius ecosystem.

✨ THE CORE CAPABILITIES

👥 The Nexus: Multi-Agent Swarm Interface Transition instantly from a single entity to a collaborative swarm. Within the Nexus WebUI, a visual "Bento Grid" displays your active sub-agents. A primary "Supervisor" model autonomously delegates tasks to specialized nodes (e.g., Mexius-Coder, Mexius-Strategist), allowing you to watch them peer-review, debate, and execute complex pipelines in real-time.

🌌 The Dream State: Autonomous Self-Evolution When toggled into the "Dream State," Mexius stops processing external requests and enters a maintenance cycle. Accompanied by a pulsing golden UI overlay, the engine autonomously defragments its vector memory, synthesizes the last 24 hours of episodic logs, and rewrites its own SOUL.md to permanently crystallize new skills and align with your ongoing objectives.

u/Mexium — 16 days ago