u/LearnedByError

▲ 9 r/oMLX

What Works for Coding on an M5 with 24GB of Universal Ram

I am new oMLX and relatively new to local LLMs. I have been trying to get Qwen 3.5, Qwen 3 or Gemma 4 running on my M5 with 24GB of universal ram using oMLX. I have tested a number of models from the mlx-community in the size range of 13 - 15GB. To date, they all blow after a few minutes of starting a task with OOM.

I would appreciate hearing what you have working for coding on a Mac with 24GB of RAM.

Is oMLX the best way to run it? I've been trying, hoping may be a better word, to find a model with TurboQuant that will handle the run of the mill dev tasks to help minimize my cost for the larger models.

Thank you in advance! lbe

reddit.com
u/LearnedByError — 3 days ago
▲ 1 r/cursor

TDD on long running taks - err I mean TDD abandoned ...

Have you found that Cursor will no long follow a TDD plan past a couple of cycles on long running tasks? It has been fighting me for the past 4 hours. I'm about read to head to less brown pastures.

reddit.com
u/LearnedByError — 3 days ago

I recently posted Z.ai 429 telling about my problem with their new rate limits.

I am looking for recommendations for API plans going forward. I am probably a medium consumer from a token perspective. Until the recent changes, the $39/month plan met my needs with me barely bumping into the 5 hour quota every now and then.

I mostly write and maintain Go with occasional forays into C, Web (html,js,ts) ands various scripting languages. I was just getting up to speed with Pi, so I still have Cursor, Copilot and Kimi Code to fall back on but would really like to get back to Pi. It fits my minimalist mentality.

Any stories about what you are using and recommendations will be appreciated.

reddit.com
u/LearnedByError — 10 days ago
▲ 14 r/ZaiGLM+1 crossposts

Has anyone else encountered this error when using Z.ai GLM-5.1 on their coding plan?

Error: 429 Your account's current usage pattern does not comply with the Fair Usage Policy, and your request frequency has been limited. For details, please refer to 

the Subscription Service Agreement. To restore access, please submit a request.

I created a ticket with them and received a long automated response 2 days later. The core of it is the following list of common reasons for account suspension:

  1. Using unofficial methods to invoke the Coding plan: Other third-party tools, self-made tools that are not introduced in the official tutorial may consider as a violation of usage rules.
  2. Abnormally high-frequency requests: Sending an extremely large number of requests in a short period will be flagged as a malicious attack, resulting in an account ban.
  3. Account sharing: Suspicious activities indicating that multiple users are sharing a single account
  4. Unauthorized reselling: Accounts suspected of selling or transferring Coding plan quotas without authorization.

For 1.: Pi is not listed on the official tutorial

For 2: I have hit my 5 hour or weekly quota a few times in the 3 months. that I have used Z.ai GLM; however, there are also days that go by where I don't use it at all. I have come no where near hitting my monthly quota.

I have not engaged in 3 or 4 at all.

My accounts rebills for the next period on May 3, tomorrow. If this is not addressed today, which it probably won't since it is Saturday, I'll be moving to a different provider.

u/LearnedByError — 12 days ago