
u/Fit_Transition8824

Denise Holt:🔴 Seed IQ is now at 10/10 games solved on ARC-AGI 3 🥳🙌🏻
This week we’ve had a lot of people suggesting that our posts are representative of our own report/interpretation of scores/performance and that they are somehow “not official.”
We’ve also had accusations of “faking it.”
➡️ Make no mistake, these LIVE Scorecards ARE the OFFICIAL evaluation validated by ARC Prize, themselves, of Seed IQ’s performance. The scorecards sit on the ARC Prize website, generated by them, not us. These details are served up from their end recording & evaluating all of the details of game performance on every level of every game Seed IQ plays. They even include replays of every level.
🔸 It doesn’t get more official than this.🔸
▪️The only thing that is not happening for us it placing Seed IQ on the leaderboard. And that is due to the fact that the ARC Prize rules state that you have to turn over your entire codebase & commercial rights to your system in order to be recognized as a contender on the leaderboard (officially entering the contest portion of the benchmark).
▪️We asked for a private evaluation, we offered to forgo prize money, and Greg Kamradt told us that option wasn’t available at this time.
▪️Yet, they clearly do it for the frontier models. Last week they evaluated both ChatGPT 5.5 (scored 0.43%) and Claude Opus 4.7 (score 0.18%), and he gave a detailed report of what they observed of those models performance on the backend.
▪️After I posted about our 5th game win, Greg commented on X about the steps he observed on the backend of our play, and he asked me what priors we are using.
➡️ They see everything we are doing. They are giving us our OFFICIAL SCORES.
(If this was something you could fake, why don’t you see anyone else posting scores like this? Why wouldn’t the ARC Prize folks be calling us out for cheating? I’ve seen them call out people for spreading misinformation about the contest.)
You would think they would acknowledge Seed IQ’s performance publicly, the same way they do frontier models who clearly aren’t turning over their codebase either, especially because we are the only system acing these challenges and crushing this benchmark.
▪️ARC Prize has positioned themselves as an entity to evaluate the best of AI. They have made it clear in the past that they do not believe DL/RL has any ability to adapt or to reason, plan, and act across novel environments. ARC-AGI 3 was positioned as an effort to spotlight advanced systems who actually can do that, and yet proprietary systems are being ignored while the entire benchmark is catering to DL/RL systems who cannot even score 1% on the challenges.
It begs a much deeper question about the real objective of this benchmark. 🤷🏻♀️
✅ Either way, we’ll keep letting Seed IQ play their games because regardless of the leaderboard, the benchmark is still acting as an official evaluation and validation of its performance. 🥳🚀
LIVE Scorecard for 10/10 games in comments…
#AIX #SeedIQ https://arcprize.org/scorecards/b65d86f3-d36f-43cb-abf9-bfa4e138d7d8
This update highlights Seed IQ achieving 100% scores on ARC-AGI 3 using active inference instead of LLM scaling. It demonstrates superhuman performance by inferring environmental priors rather than using brute force.
Denise Holt: 🔴 ARC-AGI 3 Benchmark Seed IQ UPDATE: 💡 ✅ 8/8 games now, ✅ 60 levels, ✅ 2674 total actions ... ✅ 100% overall score - and look closely at the second image here of our Seed IQ scorecard... We are actually scoring 115% 😯 on most levels. And all of this at 2-3x the human baseline. Superhuman performance across all ARC Prize challenges! 🥳 🥳 🚀
(If you're wondering why we are not on the Leaderboard, it's because we have proprietary IP and the rules state you have to turn over codebase and commercialization rights to be included.)
🔗LIVE Scorecard link: arcprize.org/scorecards/dcf…
...click around in it to see more details and replays. This scorecard is displayed directly from the ARC Prize website.
Seed IQ uses zero tokens, no RL, no central control. This is topological perception under bounded autonomy. Active Inference combined with physics-driven dynamics.
Seed IQ is not getting there by memorizing examples, scaling a foundation model, or brute forcing action sequences.
It is improving because it is getting better at inferring the priors of the environment, or the hidden structure that makes the game solvable in the first place. Those priors are the invariances, constraints, symmetries, affordances, object relations, boundary conditions, and transition rules that define what actions are admissible and what paths can actually close.. Once those priors are inferred correctly, the search space collapses. The system no longer has to explore like RL or sample like a neural network.. It can identify the governing structure of the task and move through the admissible solution manifold directly.
This is why the performance is getting both faster and more deterministic. (Superhuman level) Seed IQ is not just playing better. It is perceiving the structure underneath the game better.
#AIX #SeedIQ #ARC3 #ARCAGI3 #Quantum #DataCenters #EnergySystems
#AIXGlobalInnovations
@GregKamradt
Denis O: Seed IQ topological perception has improved to the point where we are now beating the best ARC AGI 3 human baselines on some of the most complex games available through the API by roughly half while scoring 100%.
In practical terms, Seed IQ is now performing at 2-3× human baseline efficiency, consistently and deterministically. But the important part is not just the score. It is why the score is improving.
Seed IQ is not getting there by memorizing examples, scaling a foundation model, or brute forcing action sequences.
It is improving because it is getting better at inferring the priors of the environment, the hidden structure that makes the game solvable in the first place. Those priors are the invariances, constraints, symmetries, affordances, object relations, boundary conditions, and transition rules that define what actions are admissible and what paths can actually close.. Once those priors are inferred correctly, the search space collapses. The system no longer has to explore like RL or sample like a neural network.. It can identify the governing structure of the task and move through the admissible solution manifold directly.
That is why the performance is now both faster and more deterministic. Seed IQ is not just playing better. It is perceiving the structure underneath the game better.
Meanwhile Greg or the guy running the arc prize is busy squeezing 1% from foundational LLMs with some new cool GPUs they got donated 😁😁😆🤣💀💀🐼
Please see links below
AIX Global Innovations
Denise Holt
\#ai
New game replay
https://arcprize.org/replay/a173a874-eb3f-417f-ac55-d736357d6a57
New scorecard
https://arcprize.org/scorecards/dcf7f8f9-c5a3-44a2-b747-19d2b55e5ade
Denis O. : Seed IQ topological perception has improved to the point where we are now beating the best ARC AGI 3 human baselines on some of the most complex games available through the API by roughly half while scoring 100%.
In practical terms, Seed IQ is now performing at 2-3× human baseline efficiency, consistently and deterministically. But the important part is not just the score. It is why the score is improving.
Seed IQ is not getting there by memorizing examples, scaling a foundation model, or brute forcing action sequences.
It is improving because it is getting better at inferring the priors of the environment, the hidden structure that makes the game solvable in the first place. Those priors are the invariances, constraints, symmetries, affordances, object relations, boundary conditions, and transition rules that define what actions are admissible and what paths can actually close.. Once those priors are inferred correctly, the search space collapses. The system no longer has to explore like RL or sample like a neural network.. It can identify the governing structure of the task and move through the admissible solution manifold directly.
That is why the performance is now both faster and more deterministic. Seed IQ is not just playing better. It is perceiving the structure underneath the game better.
Meanwhile Greg or the guy running the arc prize is busy squeezing 1% from foundational LLMs with some new cool GPUs they got donated 😁😁😆🤣💀💀🐼
Additionally,Please see attached links for video game play and scorecard.
AIX Global Innovations
Denise Holt
#ai https://arcprize.org/replay/a173a874-eb3f-417f-ac55-d736357d6a57
https://arcprize.org/scorecards/dcf7f8f9-c5a3-44a2-b747-19d2b55e5ade
Denis O : Another day, another ARC AGI 3 game, another 100% Seed IQ Win (9/9 levles, 2x human baseline on TU83, 3x on WA30)... Perfect 100% across 5 games.
💯🥳
#ai #aix #seediq