u/Evildude42

For a couple weeks, I've been struggling trying to get the Ubuntu betas to work. I kept running in the brick walls trying to get Intel drivers to be installed properly, with missing drivers, and missing locations, and yada yada yada.

Today I finally sat down, with the release version, to struggle and installed the Intel llm scaler, since I am using a b50 and a b580. I finally got it to run in docker, without crashing, and the speed difference from what I was running in Windows with LM studio and this running in in Linux is night and day. This is actually usable. Really usaable.

I do not get a speedometer in xcode, so I can't give you what it's doing, but it it is very much faster than what I was getting in LM studio over the network.

So the specifications, I'm using Qwen 3.6 27b q4 as the model, running on the b580 and the b50. At this point I don't have no idea which one is the primary card. I also have a t600 as the output card so that the two Intel cards can use all of their er for the llm and the cache. And if anybody cares, the CPU is a 5800x with 64 gigs of RAM.

reddit.com
u/Evildude42 — 12 days ago