u/PracticlySpeaking

▲ 2 r/oMLX

Trying to convert to oQ — MiniMax

Can anyone help troubleshoot oQ quantization?

If I am reading the log correctly, it appears to have completed dequant of MiniMax M2.7-fp8.

oMLX created a folder under .omlx/models/ but it there is no output and there are no more log messages from omlx.oq following 'FP8 dequant complete' even hours later.

2026-05-13 12:34:52,685 - omlx.oq - INFO - [-] - FP8 dequant complete: 47864 tensors spilled to /var/folders/38/9l6wxxqs1ds3_d0x7pmnbpb00000gn/T/oq_fp8_dequant_w5_qp495
2026-05-13 12:34:52,686 - omlx.oq - INFO - [-] - oQ5: FP8 dequant complete (47864 tensors)
2026-05-13 12:37:16,600 - omlx.process_memory_enforcer - INFO - [-] - Process memory enforcer started (limit: 248.0GB, interval: 1.0s)
reddit.com
u/PracticlySpeaking — 7 hours ago
▲ 2 r/oMLX

oMLX - Gemma 4 and Structured Output (JSON schema)

Any tips or secrets on getting

I spent a couple of days trying to get structured JSON from Gemma-4 models (26B-A3B-it and E4B) with oMLX + Grammar, and could not find solutions for the shape / syntax checking within grammar and returning just an error message.

The model is called from a python script that has some post-processing to deal with occasional malformed JSON (things like object vs array). Using oMLX-cli + Grammar (0.3.8), these fail an internal check that grammar uses so the only return is an error message.

reddit.com
u/PracticlySpeaking — 1 day ago

condensed from full release notes here with issues and GitHub user tags: https://github.com/NousResearch/hermes-agent/releases/tag/v2026.4.30

Release Date: April 30, 2026
Since v0.11.0: 1,096 commits · 550 merged PRs · 1,270 files changed · 217,776 insertions · 213 community contributors (including co-authors)

>The Curator release — Hermes Agent now maintains itself. An autonomous background Curator grades, prunes, and consolidates your skill library on its own schedule. The self-improvement loop that reviews what to save got a substantial upgrade. Four new inference providers, a 18th messaging platform, a 19th via Teams plugin, native Spotify + Google Meet integrations, ComfyUI and TouchDesigner-MCP moved from optional to bundled-by-default, and a ~57% cut to visible TUI cold start.

  • Autonomous Curator — hermes curator runs as a background agent on the gateway's cron ticker (7-day cycle default). It grades your skill library, consolidates related skills, prunes dead ones, and writes per-run reports
  • Self-improvement loop — substantially upgraded — The background review fork – Hermes core self-improvement that decides what memories/skills to save or update – is now rubric-based rather than free-form, biased to present (prefers most recently loaded skill), handles references/templates, and properly inherits the parent's live runtime.
  • Skill integrations — major expansion
    •  ComfyUI v5 with official CLI + REST moved from optional to built-in by default
    • TouchDesigner-MCP bundled by default, expanded with GLSL, and more!
    • Humanizer skill ports a text-cleaner that strips AI-isms.
    • claude-design HTML artifact skill + design-md (Google DESIGN.md spec) + airtable salvage + skill_manage edits in external_dirs + direct-URL skill install + /reload-skills slash command.
  • LM Studio — first-class provider — upgraded from a custom-endpoint alias
  • Pluggable gateway platforms + Microsoft Teams + Tencent (Yuanbao) messaging — the gateway is now a plugin host.
  • Google Meet plugin — join calls, transcribe, speak, follow up.
  • Spotify — native tools + bundled skill + wizard — 7 tools (play, search, queue, playlists, devices) behind PKCE OAuth
  • Four more new inference providers
  • Models dashboard tab + in-browser model config
  • Remote model catalog manifest
  • Native multimodal image routing — images now route based on the model's actual vision capability
  • Gateway media parity — native multi-image sending across Telegram, Discord, Slack, Mattermost, Email, and Signal; centralized audio routing with FLAC
  • TUI catches up to (and past) the classic CLI — LaTeX rendering - /reload .env hot-reload - pluggable busy-indicator styles - opt-in auto-resume of last session - expanded light-terminal auto-detection - session delete from /resume picker - modified mouse-wheel line scroll - a /mouse toggle that kills ConPTY's phantom mouse injection.
  • Observability + achievements plugins — bundled Langfuse observability plugin + bundled hermes-achievements plugin that scans full session history.
  • TTS provider registry + Piper local TTS — pluggable tts.providers.<name>registry; Piper ships as a native local TTS provider. (Closes #8508.) (#17843#17885)
  • Vercel Sandbox backend — Vercel sandboxes as an execute_code/terminal backend
  • Secret redaction off by default — default flipped to off. Prevents the long-standing patch-corruption incidents where fake secret-shaped substrings mangled tool outputs. Opt in when you need it.
  • Cold-start performance — visible TUI cold start cut ~57% via lazy agent init
u/PracticlySpeaking — 13 days ago