u/Anony6666

Someone just shipped an open reasoning-distilled Qwen3.6-35B-A3B, fine-tuned to imitate Claude Opus 4.7’s chain-of-thought: - 35B MoE, ~3B active/token → fits on one A100/H100 - Thinks in <think>...</think> like the teacher - Apache 2.0, weights + dataset both public
🔥 Hot ▲ 187 r/huggingface+2 crossposts

Someone just shipped an open reasoning-distilled Qwen3.6-35B-A3B, fine-tuned to imitate Claude Opus 4.7’s chain-of-thought: - 35B MoE, ~3B active/token → fits on one A100/H100 - Thinks in <think>...</think> like the teacher - Apache 2.0, weights + dataset both public

u/Anony6666 — 4 days ago