u/misanthrophiccunt

▲ 3 r/LLMStudio+1 crossposts

Is there a place where I can compare generation of tokens per second of 1 GPU VRAM+RAM vs 2 GPUs for those models that don't fit in 1 GPU?

u/misanthrophiccunt — 5 days ago