
▲ 19 r/SillyTavernAI
from RP-Bench at https://arena.l3vi4th4n.ai/results
Opus 4.7 probably is correct, and I've been having fun with DS v4 Pro, but no way GLM 5.1 is dead last surely?
I guess this just means there's no chance in hell we'll ever have good benchmarks for anything.
u/Dead_Internet_Theory — 14 days ago