u/Jazzlike_Cap9605

So I have been running gpt and glm-5.1 side by side lately and tbh the gap is way smaller than what im paying for

On SWE-Bench Pro glm-5.1 actually took the top spot globally, beat gpt-5.4 and opus 4.6. overall coding score is like 55 vs gpt5.4 at 58. didnt expect that from an open source model ngl

Switching between them during the day I honestly can't tell which one did what half the time. debugging, refactoring, multi-file stuff, both just handle it

GPT still has that edge when things get really complex tho, like deep system design stuff where you need the model to actually think hard. thats where i notice the diffrence

For the regular grind tho it's hard to care about a 3 point gap when my tokens last way longer lol. and they got here stupid fast compared to the 'Thinking' delays which is the part that gets me

Running gpt and glm-5.1 side by side. Honestly can’t tell the difference