
▲ 24 r/ClaudeAI
Ran a loop where each round runs Claude in a sandboxed Docker container with a fresh context window. The key difference is that the goal is objective and verifiable.
When I ran it on a repo, I noticed that during rounds 1-2, it found several independent low-risk vulnerabilities, but then, from round 3 onward, it started chaining them into critical exploits. This emergent behavior makes it very interesting.
u/Efficient-Lychee-100 — 16 days ago