
The Benchmark Mythos Doesn't Address. Five Days. Real Target. 140 Findings.
TLDR:
> yes mythos is a big chungus amazing model
> no you don't need mythos to compromise some of the worlds largest organisations with complex bug-chains
> stop worrying about who has the cyber infinity stones
> start worrying about the homeless dude using open-weight models to exfil 200gbs from your "SOC2 certified" corporate network