u/Material-Mention6696

Ive got a 7900xtx with 24gb vram

and want to run hermes with a local llm or using the local llm for the 90-99% of "easier tasks" and routing the hard tasks to a model like kimi k2.6

does somebody have a similiar hardware setup and some tips on what model to choose and how to optimize the hermes setup

how are you guys doing it and what are some general considerations/tips from you?

thanks