u/Efficient-Lychee-100 — reddlx

https://preview.redd.it/g98j5txd7sxg1.png?width=936&format=png&auto=webp&s=df75bc132f57cc14ba04cdd06257ba997b9bbb0b

Ran a loop where each round runs Claude in a sandboxed Docker container with a fresh context window. The key difference is that the goal is objective and verifiable.

When I ran it on a repo, I noticed that during rounds 1-2, it found several independent low-risk vulnerabilities, but then, from round 3 onward, it started chaining them into critical exploits. This emergent behavior makes it very interesting.

Repo: https://github.com/SignalPilot-Labs/AutoFyn