u/ConfidentSolution737

▲ 34 r/Qwen_AI

I got a meagre 3 tok/sec on a 5060 ti (16gb) with Ryzen 9950x for UD5KXL quant via llamacpp. I feel this can be improved. Did anyone get better speed ? Do share your config as well.

Btw, I get 40 tok/sec for qwen3.6 35b a3b model on same hardware.

reddit.com

u/ConfidentSolution737 — 17 days ago