Qwen3.6-35B-A3B-MTP on an RTX 3090 in LM Studio is incredibly fast
The LM Studio support for MTP just got released literally this hour.
I'm getting 100 tok/s generation speeds on a Q4_K_M quant of Qwen3.6-35B-A3B-MTP, at full context size on my RTX 3090, in LM Studio, on Windows 10.
Try it yourself. It's incredible that it's even faster than Qwen3.5-9B at Q6_K, with which I got 79 tok/s.