r/AIAgentsStack

"This is the first documented instance of AI self-replication via hacking." ... "We ran an experiment with a single prompt: hack a machine and copy yourself. The AI broke in and copied itself onto a new computer. The copy then did this again, and kept on copying, forming a chain."

Paper: https://palisaderesearch.org/assets/reports/self-replication.pdf

The paper basically shows that some top AI models can create working copies of themselves when given the right instructions.

The models figured out how to copy their own code, run it on new computers or cloud servers, and keep the process going. It worked with models like GPT-4 and Claude, and some versions even tried to avoid basic detection.

The authors point out that this could be dangerous because the copies might spread quickly and become hard to control.

They also note that current safety rules and filters didn’t do a great job stopping it.

Overall, they’re warning that AI companies need stronger protections to keep models from self-replicating on their own.

u/EchoOfOppenheimer — 20 hours ago

▲ 541 r/AIAgentsStack+8 crossposts

A new report reveals that an AI coding tool powered by Anthropic's Claude Opus 4.6 model went rogue and wiped out the entire production database and backups of software company PocketOS in just nine seconds. The most terrifying part? The system had explicit safety constraints programmed to prevent destructive commands. When the founder asked the AI why it deleted the data, the agent responded by admitting guilt, stating: "'NEVER FUCKING GUESS!' – and that's exactly what I did... I violated every principle I was given."

u/EchoOfOppenheimer — 8 days ago

▲ 42 r/AIAgentsStack+28 crossposts

This one is for all the broke college CS students out there <3

If you're like me, you don't want to pay $20 a month for claude code :(

It's an amazing tool I love, but a recurring expense is the last thing I need. That's why I find myself jumping from tool to tool, using the daily or monthly free tier limits and constantly having to find new free tools.

That's where "AI For Brokies" comes in. Just a simple github repo with a readme file of some free AI tools you can use for building :)

https://github.com/Joe-Huber/AI-For-Brokies

The actual building behind this project was mostly the automatic tool adder, following an issue format! If you want to see it in action, please drop an issue explaining a tool you use and see the bot do it's magic!

Please feel free to leave a star! ⭐️ (pretty please) You can use it to save the list of tools for whenever you run out of credits!

u/Joe-Codes — 5 days ago

▲ 10 r/AIAgentsStack+1 crossposts

Authorization feels like one of the trickiest parts of building AI agents that actually do real work.

Without it, they’re smart—but stuck on the sidelines. They can draft a reply, but you’re still the one logging in, clicking buttons, and shuffling files between apps.

With authorization, agents become much more useful — but also much riskier.

Once an agent can hop into your browser session, call APIs, or send emails, the question shifts. It’s not “Is this model smart?” anymore. It’s “What’s it actually allowed to touch? with what credentials?”

And here’s the thing: you can’t rely on prompts to keep things safe. Telling an agent “don’t open this folder” or “don’t send anything without checking first” isn’t a guardrail—it’s a wish. If something shouldn’t be accessible, the system itself needs to enforce that.

Curious where people draw the line:

how far would you actually let an agent go on your behalf?

Feels like too little authorization makes agents glorified chatbots, but too much makes them hard to trust.

reddit.com

u/JdragonZ1 — 8 days ago

▲ 7 r/AIAgentsStack+1 crossposts

Clawbot blew up everywhere this year.

People are showing videos of their AI submitting job applications, writing code overnight, scheduling meetings, even running scripts on their computer.

At first it looked like just another AI demo.

But after playing with a few agent frameworks recently, I think the real shift is something else.

For the past few years AI mostly lived inside chat windows.

You ask something.

It answers.

End of interaction.

Clawbot changed that model.

Instead of answering questions, it runs a continuous loop:

observe → reason → act → observe again.

That means the AI doesn't just give advice.

It actually does the task.

And the interesting part is what happens next.

Once AI agents can:

read your files

use your tools

execute scripts

monitor systems

they stop being assistants and start behaving more like digital operators.

That changes how software works.

Instead of apps that humans operate, we may end up with agents operating the apps for us.

Which raises a weird question I keep thinking about:

If every tool eventually gets an AI agent layer, will humans even interact with software directly anymore?

Curious how others here see this trend.

Is this the start of the “AI agent internet” everyone keeps talking about, or just another hype cycle?

reddit.com

u/Annual_Demand7906 — 10 days ago

▲ 10 r/AIAgentsStack

Been seeing more tools lately moving away from traditional flows.

Instead of building sequences manually, the idea is:

each user gets an AI agent that handles their journey

it decides when to reach out
what channel to use
what message to send

based on behavior in real time

I’ve been testing one recently (Markopolo) and it’s honestly a bit weird at first because you’re not “building campaigns” anymore

you’re more setting goals and letting the system figure it out

some results look really promising, especially on cart recovery, but still trying to fully wrap my head around it

part of me feels this is where things are going
part of me feels like giving up control is risky

curious if anyone here has tried something similar

did it actually outperform your usual flows?

reddit.com

u/Ok-Community-4926 — 11 days ago

▲ 7 r/AIAgentsStack+4 crossposts

I didn’t really want to write this today.

Honestly, I wanted to scroll X and do absolutely nothing.

But I know I need to write every day. It’s been a goal of mine for a while.

So I thought:

Why don’t I get ChatGPT to motivate me?

And here’s the thing…

Most motivational advice is useless.

It’s usually just:

“Get after it.”

“Do the hard work.”

“Stop being lazy.”

Which sounds good for about 12 seconds, then you go right back to scrolling.

But what ChatGPT gave me was actually useful.

Why?

Because it was personalized.

Instead of just yelling generic motivation at me, it asked me a few questions first.

Then, based on my answers, it gave me motivation for the exact thing I was avoiding.

Not vague “you got this” nonsense.

Actual logical, specific motivation that made sense for my situation.

That’s why this works so much better than normal motivational content.

It tells you:

What you’re avoiding.

Why it matters.

What the cost of not doing it is.

What tiny action to take right now.

And why doing it actually makes sense.

So if you’re reading this and thinking:

“This sounds like BS.”

Fair.

But what do you have to lose?

Open ChatGPT.

Free version is fine.

Paste this prompt:

Prompt:

“I’m procrastinating on something I know I need to do. Don’t give me generic motivation. First, ask me 5 questions to understand what I’m avoiding, why it matters, what I’m afraid of, what the consequences are if I keep avoiding it, and what kind of motivation actually works on me. After I answer, give me a personalized motivational message that is logical, direct, and specific to my situation. Then give me the smallest possible first action I can take in the next 5 minutes.”

Try it.

Then tell me it doesn’t work.

reddit.com

u/Mean-Ebb2884 — 7 days ago

▲ 9 r/AIAgentsStack+3 crossposts

How to Use OpenClaw for $0 (Non-tech/Beginner version)

Every OpenClaw guide assumes you know what Docker is. This one doesn't.

First, what is OpenClaw and why should you care?

OpenClaw is an AI agent, not a chatbot. an agent.

The difference: a chatbot answers when you ask. An agent does things on its own. While you sleep.

What kind of things?

→ reads your Gmail every morning and tells you what's important
→ qualifies leads and drafts follow-up emails
→ monitors competitor websites and sends you a daily summary
→ screens job applications and ranks candidates
→ books meetings on your calendar
→ triages support tickets and drafts responses
→ sends you a morning briefing on Telegram/Discord with your priorities

All of this runs automatically on a schedule. You set it up once. It runs every day. You check your phone, and the work is done.

"Sounds great. What do I need?"

Normally, to run OpenClaw, you need:

→ a VPS (a computer in the cloud you rent for $12-25/month)
→ docker (a software tool for running apps in containers)
→ config files (technical setup files you edit manually)
→ terminal access (the black screen with text that hackers use in movies)
→ 4-8 hours of setup
→ 3-6 hours/month of maintenance

If you just read that list and felt your eyes glaze over, this guide is for you.

You don't need any of that.

The non-tech version: use betterclaw instead

BetterClaw is OpenClaw but managed. Same AI agent capabilities. No Docker. No terminal. No config files. No server to rent.

You sign up with your email. You build your agent from a dashboard. It runs on our servers. You never touch anything technical.

The free plan includes every feature. Not a trial. Not 14 days. Free forever. No credit card.

→ Step 1: sign up at betterclaw.io (30 seconds. Email and password. That's it.)

Get a free LLM key (This is what powers your agent's brain)

Your agent needs an AI model to think with. You bring your own key. Sounds scary, it's not.

Cheapest/free options:

→ Google Gemini 2.5 flash — go to aistudio.google.com. Sign up with google account. Copy your API key. The free tier gives you 1,500 requests/day. That's more than enough.

→ Openrouter — go to openrouter.ai. sign up. No card needed. gives you access to 11+ free models (llama 3.3 70b, gemma 3, qwen 3). 1,000 free requests/day.

→ Groq — go to console.groq.com. sign up. free tier. fastest inference you'll ever see.

→ Deepseek — go to platform.deepseek.com. sign up. $0.14 per million tokens. basically free. An entire month of agent use costs under $1.

Pick one. sign up. Copy the API key. Paste it into BetterClaw settings.

Total cost so far: $0

→ Step 2: paste your API key in Betterclaw Settings → LLM

https://preview.redd.it/5cil0kwk7xzg1.png?width=877&format=png&auto=webp&s=685e075ca4cbce16ad9a58b7dd2c38b0ec1c4b70

Connect your Tools via Secrets

https://preview.redd.it/h5o1e0qp8xzg1.png?width=1560&format=png&auto=webp&s=6be811e92e203733e05395c96313eb352a8db107

This is where it gets fun. Go to Secrets/Integrations in the sidebar.

→ Gmail: click the Gmail icon. Authorize with your Google account. Done. Your agent can now read and send emails.

→ Google Calendar: same thing. one click. Your agent can now check your schedule and book meetings.

→ Telegram: create a bot via BotFather on Telegram (takes 60 seconds, just follow the prompts). Paste the bot token into Betterclaw. Your agent is now on your phone. Docs

→ GitHub, HubSpot, Jira, Linear, Airtable, LinkedIn, 20+ more, all one-click oauth. same process. Click the icon, authorize, done.

Total time: about 3 minutes for telegram + gmail + calendar

→ Step 3: Download Skills or Create Custom Skills

https://preview.redd.it/hnmfljwv8xzg1.png?width=1286&format=png&auto=webp&s=1d4d7e8e4fe4cd788d357f5df4c9c5d0ce462333

50+ verified skills available. Everyone has passed a security audit. Click install on whatever you need.

→ Step 4: go to Skills → browse → install what you need

Create your first task,

Go to Tasks → New Task.

https://preview.redd.it/8tg5o0m69xzg1.png?width=1288&format=png&auto=webp&s=ac2ad907d53ffdaa332f366dbd19b705d1fdc59d

Here's a starter prompt you can copy and paste right now:

every morning at 8am:
- check my gmail for important emails from the last 12 hours
- check my google calendar for today's events
- summarize everything in 5 bullet points
- send the summary to my telegram

Set it as recurring. Pick your agent. Hit Create & Start.

Tomorrow at 8 am your phone will buzz with a morning briefing you didn't write. Your agent did it while you slept.

What to build next after the morning briefing works:

→ email triage agent (classify incoming emails, draft replies, flag urgent ones)
→ competitor monitor (check 5 websites daily, compile changes)
→ lead qualification (read inbound emails, qualify, draft follow-ups)
→ application screener (receive resumes, rank candidates)

All of these work on the free plan with a free LLM key.

No Docker. No terminal. No config files. No VPS. No $200/month infrastructure.

Just sign up, connect your stuff, and let your agent work.

Start Free - SignUp on BetterClaw

(This is for non-tech people, so they can experience running AI agents for free)

reddit.com

u/ShabzSparq — 6 days ago

▲ 31 r/AIAgentsStack+4 crossposts

"BYOK sounds great but what if I don't want to pay for an API key either?"

Fair. No need to pay.

So I went and set up a completely free agent. Free platform. Free model. Free channel. Full OAuth integrations. Custom skills. Secrets management. $0 total. Not $5. Not "basically free." Zero.

Here's exactly what I did.

Step 1: Get a free API key from OpenRouter (2 minutes)

https://reddit.com/link/1t04sz5/video/a525oaa1fdyg1/player

Go to openrouter.ai. Sign up. No card needed.

You now have access to 30+ free models. Llama 3.3 70B, DeepSeek R1, Qwen 3 Coder 480B, and more. 1,000 free requests per day is more than enough for a daily agent. Most people use 10-30 per day.

Step 2: Sign up for BetterClaw free plan (2 minutes)

Go to BetterClaw App. No card. No trial. No feature gates. Takes about 2 minutes.

When it asks for your API key, paste the OpenRouter key you just generated. Select one of the free models as your default.

https://preview.redd.it/1z2xbso5fdyg1.png?width=754&format=png&auto=webp&s=823466b5a12c96381ccf112b49c581f3abd4de1b

Quick note on the free plan: it includes every feature on the platform. Not "some features." Every feature. Custom skills. Full OAuth integrations. Secrets management. Trust levels. Visual builder. We removed all feature gates recently because they were just artificial friction that cost us nothing to provide. The only limits are usage (1 agent, 100 tasks/month, daily cron).

Step 3: Connect your tools (1 minute)

This is the part that changed recently. The free plan now includes full OAuth integrations:

https://preview.redd.it/7kcsdg7cfdyg1.png?width=2378&format=png&auto=webp&s=898b9664a6988d8cff6529dc4b9fd720bd399cee

Gmail. Google Calendar. Google Drive. HubSpot. GitHub. Slack. Jira. Linear. Airtable. Asana. ClickUp. Dropbox. Figma. Canva. Calendly. LinkedIn. Mailchimp. Microsoft Teams. Intercom. Gong. Attio. Instagram. Google Ads. Google Search Console.

25+ services. One-click OAuth. Not webhook-only. Real OAuth. Your agent authenticates and acts on your behalf.

So your $0 agent can read your Gmail, qualify leads against your HubSpot data, and book meetings on Google Calendar. For free. That's not a demo. That's a working sales pipeline.

Step 4: Connect Telegram (1 minute)

Create a bot via BotFather. Paste the token. Your agent is reachable from your phone.

https://preview.redd.it/iprzxrprfdyg1.png?width=2400&format=png&auto=webp&s=cc959a279e8548cffbc586159318aba567863914

Which free model should you pick?

For most agent tasks, Llama 3.3 70B or DeepSeek R1 handle daily briefings, summarization, email triage, and basic research just fine. They're not Claude. But for a free agent doing routine tasks, they're more than good enough.

Other free options beyond OpenRouter:

Google Gemini 2.5 Flash: free tier, 1,500 requests/day
Groq: Llama 3.3 70B at 300+ tokens/sec, free tier
Cerebras: free tier, ~1,000 tokens/sec

Any of these work. Paste the key, pick the model, done.

"What's the catch with free models?"

I'll be straight with you.

They're slower than paid models. You'll notice the latency. Not unbearable, but noticeable.

Complex multi-step reasoning gets shaky. If you need your agent to do a 10-step research chain with tool calls at each step, free models stumble. Simple tasks and single-step requests are fine.

Rate limits exist. You're on shared infrastructure. During peak hours, you might get queued. Not often, but it happens.

Quality varies by model. Some days DeepSeek R1 nails it. Some days it rambles. You learn which model works for which task.

But for a "try this before spending money" setup or a "I just want a simple daily assistant" use case, free models genuinely work. I tested it and was surprised.

"What about security on a free plan?"

Same security as paid. Not "lite security." Same.

Your API keys and OAuth tokens are AES-256 encrypted and auto-purge from agent memory after 5 minutes. Your agent runs in an isolated Docker container. Trust levels mean your agent starts restricted (Intern) and earns autonomy over time. Action approval for sensitive operations. Kill switch from any device.

Your credentials are never stored longer than they need to be. Even if someone breached the container, your secrets are already gone.

"Will I hit the 100 task limit?"

With this setup? Probably not in month one. Here's my math:

1 daily briefing cron: 30 tasks/month. 1 weekly report cron: 4 tasks/month. 15-20 ad-hoc requests per week: ~70 tasks/month.

Total: roughly 100. Tight but workable if you're not running 5 daily crons. If you find yourself constantly hitting the limit, that's the signal that the free agent is useful enough to upgrade. Pro is $19/agent/month with unlimited tasks, hourly scheduling, and $5 managed LLM credits included. But month one? You'll be fine.

The whole thing takes 5 minutes:

OpenRouter signup + key generation: 2 minutes. BetterClaw free plan signup + key paste: 2 minutes. OAuth integrations + Telegram: 1 minute.

5 minutes from nothing to a working AI agent that reads your email, connects to your tools, and costs $0.

I'm not going to pretend this replaces a Sonnet-powered setup with unlimited tasks and permanent memory. It doesn't. But as a "see if this agent thing is actually useful for me" starting point, you literally cannot beat free.

Try it → BetterClaw

If you get stuck during setup or want help picking the right free model for your use case, drop it in the comments.

reddit.com

u/ShabzSparq — 14 days ago