▲ 34 r/Qwen_AI
I got a meagre 3 tok/sec on a 5060 ti (16gb) with Ryzen 9950x for UD5KXL quant via llamacpp. I feel this can be improved. Did anyone get better speed ? Do share your config as well.
Btw, I get 40 tok/sec for qwen3.6 35b a3b model on same hardware.
u/ConfidentSolution737 — 17 days ago