
AI 2027 is 88% accurate so far
ex: AI 2027 projected the frontier CyBench score to be 85% by now -- yet Claude Opus 4.6 and Mythos score 100%. It projected OSWorld at 80% -- yet Mythos scores 79.6%. It projected AI to clear 8-hour tasks on RE-Bench -- yet Mythos clears 8 hours on Anthropic's internal RE-Bench.
u/gbomb13 — 21 hours ago