u/sibraan_ — reddlx

Google built a working OS from scratch using AI agents for under $1,000 in API credits. It took 93 subagents, 12 hours, 15K model requests, 2.6B tokens...

u/sibraan_ — 16 hours ago

Octoparse MCP Challenge — $1,800 in prizes

Octoparse just launched their MCP Server, it connects Claude, ChatGPT, and Cursor directly to web scraping through plain English. You say what you want, it pulls the data. No code needed.

And they're running a $1,800 build challenge right now:

Track 1 — $600: Build any AI scraping workflow using Octoparse MCP
Track 2 — $800: Build an advanced workflow or AI Agent using OTD + MCP
Most Creative Build ($400): Picked from Track 2 automatically.

Everyone who participates gets 5,000 free data lines + 2,000 free lines every week.

Exclusive r/AgentsOfAI code: OCTOMCP-R8P4
This is a special code for Free and Basic users, and can be used to claim extra credits after connecting MCP on your platform. Reply with your code to redeem.

📅 Challenge runs: May 12 – June 15 | Winners announced: June 30

Challenge brief + submission form in the comments below 👇

u/sibraan_ — 3 days ago

🔥 Hot ▲ 7.3k r/AgentsOfAI+3 crossposts

Biggest AI fumble in tech

u/21734234 — 4 days ago

▲ 7 r/AgentsOfAI+3 crossposts

Octoparse MCP Challenge ($1,800 in Prizes): Build AI Web Scraping Workflows (No Code)

Hey guys,
Octoparse just launched Octoparse MCP Server, it connects Claude, ChatGPT, Cursor, and other AI clients directly to web scraping through natural language. You say "get me the top 100 coffee shops in NYC from Google Maps" and it just... does it.
No code, no manual setup.

To celebrate, they're running a build challenge with $1,800 in prizes:

Track 1 ($600): Build an AI web scraping workflow using Octoparse MCP. You can combine it with other MCP tools too. Real use case, show it working.

Track 2 ($800): Use Octoparse OTD + MCP to build an advanced workflow or AI Agent.

Most Creative Build ($400): Picked from Track 2 automatically.

How it works:

Build something with Octoparse MCP Server
Publish your work anywhere (article, video, GitHub, etc.)
Submit via the form: [Google Form Link]

(OPTIONAL) Share on X with #BuildWithOctoparse and @ Octoparse, automatically entered into the $200 X Prize Draw.

Everyone gets 5,000 free data lines (no expiry) + 2,000 free data lines every week just for participating.

Exclusive code for the community: OCTOMCP-R8P4
This is a special code for Free and Basic users, and can be used to claim extra credits.
How to redeem: after successfully connecting the MCP on your AI platform, reply with your coupon code to claim the reward.

Challenge period: May 12 – June 15, 2026 | Winners announced: June 30, 2026
Challenge brief and Submission form are in the comments below.

reddit.com

u/nitkjh — 6 days ago

▲ 53 r/learnmachinelearning

The reason your enterprise RAG pipeline degrades over time (it's not the model)

Spent the last few months debugging production AI systems for a handful of mid-to-large orgs, and I keep seeing the same failure pattern that nobody really talks about in the benchmarking literature.

The model isn't the problem. The retrieval isn't even really the problem. The problem is document heterogeneity rot.

Here's what I mean. When you first stand up a RAG system, your corpus is relatively clean. You've chunked it, embedded it, indexed it. The retrieval scores look great in eval. Then six months pass.

Now you have:

A 2023 policy doc that was superseded by a 2024 amendment that lives in a completely different folder
Meeting transcripts that reference decisions that were later reversed via email (which is not indexed)
Contracts with line-item exceptions that got negotiated verbally and exist only in someone's Outlook

Your retrieval system has no concept of document authority hierarchy. It treats a deprecated policy PDF the same as the current one because cosine similarity doesn't care about org chart logic or recency signals beyond naive metadata.

The fix isn't better chunking or a bigger embedding model. It's building provenance chains into your indexing architecture from the start so the system knows not just what a document says, but whether it's still true.

A few teams I've seen handle this well (firms like 60x working in the enterprise space, some internal teams at larger consultancies) are essentially building a lightweight governance layer that sits between ingestion and retrieval tagging documents with confidence decay rates and authority signals rather than treating the corpus as a flat library.

It's more engineering overhead upfront. But it's the only thing that actually keeps production accuracy from drifting.

reddit.com

u/sibraan_ — 9 days ago

▲ 0 r/Rag

I’ve spent the last few months talking to Ops leaders who are frustrated with the one-step-forward, two-steps-back nature of their AI implementations.

The story is always the same: They build a custom GPT or a standard RAG (Retrieval-Augmented Generation) system, feed it their SharePoint/confluence, and it works great for about a week. Then, the hallucinations start. The AI forgets a policy update from last Tuesday or mixes up a 2019 contract with a 2024 renewal.

The problem isn't the LLM. It’s that your AI doesn't have a Context Graph.

Most people assume a Knowledge Graph is enough. A Knowledge Graph is great at saying "Entity A is related to Entity B" (e.g., Paris is the capital of France). It’s a static map.

But in a high-stakes business, facts aren't static. They have temporal traces and causal edges.

A Context Graph (what we’re seeing firms like 60x and others move toward) doesn't just record what is connected; it records the how, when, and why.

Temporal Context: It knows that a 30% discount approved by a VP on Friday was a one-off exception for a specific client, not a new company-wide policy.
Decision Traces: It maps the lineage of a decision. When the AI gives an answer, it’s not just pulling a text chunk; it’s traversing a graph of past approvals and meeting outcomes.

If you’re building AI for a business that relies on institutional memory, you have to stop treating your data like a giant library and start treating it like a living network of decisions.

reddit.com

u/sibraan_ — 15 days ago

▲ 9 r/twin

u/sibraan_ — 27 days ago