4-bit Qwen3.6 MTP GGUF cited 70+ websites with one prompt!
4-bit Qwen3.6 MTP GGUF managed to search 70+ sites from a single prompt.
Try this locally with Unsloth Studio on 20GB RAM.
Unsloth now supports automatic MTP + speculative decoding for supported models. Unsloth also now auto-selects the best MTP settings for your specific device (Mac, CPU, GPU etc.)
We also fixed many bugs and issues including tokens/s not showing up correctly and MTP not being applied properly.