u/Apart-Medium6539

I gave AI agents eyes on my PC

I built Pupil, an open-source tool.

The pain point: too many screenshots sent to AI tools just to ask where to click.

Now the agent can inspect the UI, point at the target, and wait for approval.

Feedback welcome.

reddit.com
u/Apart-Medium6539 — 4 days ago

Pupil: I gave ChatGPT eyes on my PC

I built Pupil, an open-source tool for AI agents.

Instead of uploading screenshots to ask where to click, the agent can inspect the app, highlight the target, and wait for approval.

Github

github.com
u/Apart-Medium6539 — 4 days ago
▲ 9 r/Agent_AI+5 crossposts

I gave ChatGPT eyes on my PC, now it can show me where to click

I kept sending screenshots to ChatGPT just to ask “where do I click?”

So I built Pupil, an open-source tool that lets an AI agent inspect desktop apps, point at the right place, and wait for approval before clicking. (tab to approve)

Demo: I ask it to find Discord’s data/privacy settings.

Looking for feedback on the idea and UX.

Github

u/Apart-Medium6539 — 4 days ago