u/Material-Mention6696

Best Local LLM for 24GB of VRAM?

Ive got a 7900xtx with 24gb vram

and want to run hermes with a local llm or using the local llm for the 90-99% of "easier tasks" and routing the hard tasks to a model like kimi k2.6

does somebody have a similiar hardware setup and some tips on what model to choose and how to optimize the hermes setup

how are you guys doing it and what are some general considerations/tips from you?

thanks

reddit.com
u/Material-Mention6696 — 9 hours ago