u/zengoind

Have you actually used 256K/1M context for messy workflow inputs?

​

Most long-context talk still sounds like a chat demo. The uglier test is whether a model can hold a PRD, logs, docs, tests, repo slices, prior outputs, and contradictory notes from earlier runs in one working context without everything turning brittle. That is why Ling-2.6-1T is interesting to me. The official docs say it supports up to 1M native context, while the official API currently exposes 256K. The public materials also keep pairing that with fast thinking and lower token overhead. If that matters in practice, the win is not "it can chat forever." The win is fewer chunk / summarize / stitch passes, less context loss between steps, and less prompt glue holding the workflow together.

Have you tried a long-context model on work like this? PRD + repo + tests, long incident logs, or multi-run agent state with conflicting notes. Where did it actually help you, and where did it still make you clean the mess by hand?

reddit.com
u/zengoind — 19 hours ago

Coding-agent evals should probably score tokens spent per completed task

​

What stands out to me about Ling-2.6-1T is not just that it's a 1T flagship. The official positioning is unusually explicit about efficiency: fast thinking, lower token overhead, and getting from logical reasoning to task execution with minimal compute overhead. That makes me think our evals are still incomplete. For coding agents and automation pipelines, the real question is often how much a model spends before the task is actually done. Token burn, latency across long tool chains, and retry rate all matter once you leave demo mode. A model that is slightly less flashy on prestige benchmarks but materially better on task-completion-per-token could be more valuable in practice than one that looks great in a screenshot and quietly torches your budget.

If you were comparing agent models tomorrow, what would matter more to you: completed tasks per $1, completed tasks per 100k tokens, time to finish a long tool chain, or failure rate after 10 steps ?

reddit.com
u/zengoind — 19 hours ago

I have been playing guitar for over an year now as a hobby ofc. dedicating around 30mins to 1 hour daily on it. (lost touch in between for 3 4 months). Now I can play all chords, including barre chords, transition smoothly between them, know all the popular strummings, and can play a few solos/riffs of my favourite songs after some memorisation and practice. What is the next step from now on? Music theory? Scales? Please help. Feel stuck

reddit.com
u/zengoind — 13 days ago