r/AskVibecoders

▲ 153 r/AskVibecoders+3 crossposts

How I cut Claude Code token usage in half (open source, benchmark included)

Been working on Repowise for a few months now. The core idea: AI coding agents are only as good as the context they get. Most of the time, that context is terrible.

Cursor reads your files. It doesn't know your architecture. It doesn't know which files break the most. It doesn't know why you made that weird design decision in auth six months ago.

So I built a layer that sits between the codebase and the agent.

Four things it does:

  1. Parses your AST into a dependency graph (NetworkX). Agents can reason about structure.

  2. Mines git history into hotspot and ownership maps. Who wrote what, what breaks most.

  3. Generates an LLM wiki of your codebase and stores it in a vector DB. Always in sync.

  4. Captures architectural decisions as ADRs so agents have intent context, not just code.

Exposes 8 MCP tools. Works with any MCP-compatible agent. Also has a local web UI to explore the graph and docs yourself.

AGPL + commercial dual license. Self-hostable.

Got a few hundred GitHub stars pretty fast. Then someone cloned it on PyPI three times in a week violating the license, had to file a DMCA. Wild week.

Happy to answer questions on the technical side or the distribution side. Both have been interesting.

Repo: https://github.com/repowise-dev/repowise

Dogfooding on website: https://repowise.dev

A star would really help with visibility!

u/Obvious_Gap_5768 — 13 hours ago
▲ 687 r/AskVibecoders+2 crossposts

Last week, I read about how vibe coders were burning 100 million tokens for just a few dollars in research, and I wrote an article about it.

So basically, I did deep technical research into the tools and methods people use for this (basically anyone can replicate it), how the process works, and how it’s also being used for training smaller models and in the process they make million dollars.

here is the deep research over it if anyone is interested

https://x.com/HarshalsinghCN/status/2056626175959826692?s=20

let me know your views about this, also this is long article not for doomscrollers

u/Which_Pitch1288 — 1 day ago
▲ 1 r/AskVibecoders+1 crossposts

Managing context limits in large C++20 Module codebases with MCP (Case Study & Tool)

Hi everyone,

Working with LLMs on modern C++ codebases usually hits a wall very quickly: context windows get flooded with massive files, and most standard indexers still struggle with C++20 Module partitions and imports.

We are currently running a live development workflow on a large-scale commercial project consisting of over 7,000 source files, mostly utilizing C++20 modules.

We managed to establish a highly performant workflow using the Codex App on the desktop, combined with VS MCP, IDAP MCP, and a dedicated lightweight tool we created to bridge the C++ gap: mcp-cpp-project-indexer.

The Problem We Solved

Standard file-dumping or naive regex indexing either sends thousands of lines of irrelevant code to the LLM (costly and slow) or completely loses track of C++20 module dependencies.

Instead of trying to replace a full compiler/LSP (like clangd) or performing heavy semantic analysis, our indexer acts purely as a stream- and token-based locator. It maps out files, symbols, and module structures, providing the LLM with exact line references (startLine/endLine).

The Setup & Results

  • The Stack: Codex App + VS MCP + IDAP MCP + mcp-cpp-project-indexer.
  • Token Reduction: The LLM only requests and reads the exact code fragments it actually needs. This reduces the text sent to the LLM by up to 86%.
  • Performance: Written in Python, it includes a file watcher mode that calculates hashes incrementally. It stays up-to-date in real-time during active development without hammering the CPU.
  • Intelligence: Code/ChatGPT confirmed that the context routing works flawlessly even at this 7,000-file scale.

Why share this?

When we started, we couldn't find a lightweight, production-ready way to make Claude/GPT understand a massive C++20 module graph without spending a fortune on API tokens or waiting ages for context processing. This setup proved that the Model Context Protocol (MCP) is absolutely ready for large enterprise codebases if decoupled correctly.

The project is fully open-source. If you are struggling with C++ context limits or modules in your AI workflow, feel free to check it out, spin it up, or contribute:

👉 GitHub: github.com

I’m happy to answer any questions about how we configured the MCP synergy or how the incremental indexing handles the C++20 module tree!

u/Narrow_Cartoonist937 — 19 hours ago
▲ 62 r/AskVibecoders+3 crossposts

I replicated Anthropic's Generator-Evaluator harness to build a website through 12 adversarial AI iterations - here's the result and what I learned

Anthropic recently published their harness design for long-running apps - a multi-agent architecture inspired by GANs where a Generator builds code and an Evaluator critiques it in a loop.

I built my own version using Kiro CLI and used it to generate a marketing website for my project Mnemo (persistent memory for AI coding agents).

The architecture:
Planner (runs once) → Generator ↔ Evaluator (12 iterations)
Each agent is a separate CLI process with zero shared context. They communicate only through files (spec.md, eval-report.md). The Evaluator uses Playwright to actually browse the live site - not just read code.

What made it work:

Clean slate per invocation — each agent starts fresh, reads only its input files. Prevents context anxiety.

Playwright MCP for testing — the evaluator navigates, clicks, resizes viewports. Catches visual bugs code review never would.

Anthropic's frontend design skill — explicitly penalizes generic AI patterns (Inter font, purple gradients, card layouts). Forces creative risk-taking.

Continuous iteration, not retry-on-failure— all 12 rounds run regardless. Each one improves.

The progression was wild:
Iteration 1: Exactly what you'd expect from AI - functional but forgettable

Iteration 4: Generator pivoted to "Terminal Noir"
IBM Plex Mono, amber on black, grain textures, scanlines. This is the kind of creative leap that doesn't happen in single-shot generation.

Iterations 5-12: Polish, accessibility, responsive fixes, reduced-motion support

Stats:
Total time: 3h 20min
Iterations: 12 (generator + evaluator each)
Manual code written: 0 lines
Tech: Next.js, Tailwind, Framer Motion, TypeScript

Live result: https://mnemo-mcp.github.io/Mnemo/

Key takeaway: The model is the engine. The harness, the constraints, feedback loops, and adversarial structure around it, what determines whether you get AI slop or something genuinely distinctive.

Happy to answer questions about the architecture or share the prompt files.

u/killerexelon — 3 days ago

The hardest part of vibe coding no one talks about

I’ve been working on a single app for 10 months. Vibe coded the whole thing. Never looked at a line of code. Added quality assurance tests to catch any changes I wasn’t expecting. Should have done that earlier. Ran through code reviews of memory leaks, race conditions and uncaught errors. Performance tested it. Rewrote sections to be performant. Tests for all functionality. Administration interfaces to manage the application. Automated backups and email server. Built a scraper and went through iterations of scraping engines. Built data cleansing and data normalization functions. Ran penetration testing. But, the hardest thing I’m doing by far is operationalizing it. You can try and close all the holes you can think of but it’s the unexpected that ties you up. A random crash. An edge condition. System level functions that don’t consistently function as expected. What’s funny to me is I thought this part was going to be the easy part.

reddit.com
u/CreamPitiful4295 — 2 days ago
▲ 6 r/AskVibecoders+3 crossposts

I built extension which exports base44 lovable bolt rork projects in zip

Built a small Chrome extension called VibeApp Exporter because I got tired of manually rebuilding projects from AI app builders 😅

I was using tools like Lovable, Bolt, Base44 and Rork on their free plans. The annoying part was that exporting projects properly usually wasn’t available for free users.

So every time I wanted to continue the project locally in VS Code, I had to:

  • manually create folders
  • create nested subfolders
  • copy each file one by one
  • paste code manually
  • rebuild the whole structure myself

It became super time consuming once projects started getting bigger.

So I built an extension that:

  • detects the project tree
  • opens nested folders automatically
  • extracts files + code
  • preserves exact folder structure
  • exports everything into a ZIP

It currently supports:

  • Lovable
  • Bolt
  • Base44
  • Rork

Everything runs browser-side and the code isn’t uploaded anywhere.

Honestly started as a personal pain-point project but figured other vibe coders probably deal with the same thing.

Still improving extraction reliability for different builders and edge cases, but it’s already saving me a lot of time.

Install for Free: VibeApp Exporter

u/MudasirItoo — 3 days ago

Looking for Claude Code Skill Files for Better UI Design & SEO Tasks

Hey everyone,

I'm a frontend developer and currently using Claude Code for development tasks. The issue is that most of the generated Ul/designs feel very basic and generic.

I'm looking for good skill files or methods that can help it build better Ul designs.

I also want skill files for SEO improvement tasks.

So, if anyone has good skill files, GitHub repos, or methods for frontend development tasks and website SEO tasks, please suggest them.

reddit.com
u/movies1804 — 3 days ago

What tools are there for visualising an agentic session?

Software engineer for 13 years. Been agentic coding for about three weeks.

Something I'm finding a bit frustrating is:

  1. I can assign Claude a github issue and it'll go off of create a PR, but then I see that like half of my token allocation is used. I want to be able be able to see the cost of that session.

  2. If I'm using Claude Code in my IDE/terminal I can watching the reasoning occur and intervene if needed, though even that gets more difficult when spawning subagents.

What I really want to be able to do is a see a graph of exactly what happened, when sub agents got spawned, what context they then provided back into their spawning agent etc.

The idea then is that I can use this to optimise my prompts.

I feel like this must fairly well trodden territory - any suggestions for tooling?

Thinking to the Github driven workflow -imagine some kind of HTML page attached to the pull request that lets you explore it.

reddit.com
u/davidblacksheep — 2 days ago

How would you build a conversational control layer for client/brand workflows?

I’m building a client workflow dashboard and need architecture advice.

I’m trying to keep the dashboard/database as the source of truth and use the bot as a command layer, but I’m not sure if that’s the right pattern.

Each client would have workspaces, agents/workflows, run logs, outputs, analytics, and approvals.

I want a conversational control layer where I can type things like:

* “Run monthly report for Client A”

* “Show failed workflows”

* “Add SEO workflow to Client B”

* “Create a GitHub PR for this agent config”

* “Summarize this week’s outputs”

I’m debating where the bot should live:

* Slack bot

* Telegram bot

* chat panel inside the dashboard

* combination of the above

Stack I’m considering:

* Vercel dashboard

* Railway API/orchestrator

* Postgres

* GitHub for configs/code changes

* LLM API

* background job queue

Main question:

What’s the cleanest way to connect a conversational bot to deployed workflows?

Should the bot call APIs directly, create queued jobs, trigger GitHub workflows, or only create approval requests that the dashboard executes?

I’m especially interested in permissioning, audit logs, human approval steps, and avoiding accidental production changes.

reddit.com
u/SeNorMat — 2 days ago

how much do you spend on ai coding?

I have been speaking to my dev friends and answers vary WILDLY

A few of them say that they can survive on the free tiers, most say that they spend between $20-$150 but I have had some absolutlely insane answer. One guy told me that him and his only other technical teammate at his company blew through $65k last month!!!

So, how much are you burning through?

reddit.com
u/Complete-Sea6655 — 4 days ago
▲ 1 r/AskVibecoders+1 crossposts

I built an SSH client with a built-in AI agent after getting tired of switching between terminal and ChatGPT — here's what I learned

Disclaimer: I'm the developer of this tool.

Been working in network engineering for years. The constant context-switching between SSH sessions and AI tools was killing my workflow — copy output, paste to ChatGPT, get a suggestion, go back to terminal, repeat.

So I built something that keeps the AI inside the terminal itself. Three modes: Read Only (safe inspection), Fix Mode (approve each command), Auto Pilot (autonomous). Works through jump hosts, auto-detects vendor syntax, never sends your configs to our servers.

Still early — would love feedback from people who actually live in terminals. What's missing? What would make you switch from your current client?

netcopilot.app (free download, open source)

u/anasabdullkarim — 4 days ago
▲ 1 r/AskVibecoders+1 crossposts

I now have 43 submissions on my web app!

Post your project here 👇

https://shipitorskip.com

I created a web app like facemash but for vibe/indie projects

Been seeing a ton of people sharing their builds across Reddit and Twitter for thoughts on vibe coded and indie built projects.

It’s basically a head to head format where two projects show up and you just pick which one you’d ship or skip. There's a live leaderboard, and you can look projects up by categories too.

Trying to keep it fast and easy to use, I just added a small “why I built this” section so you can understand the intent behind a project before voting.

Still early and a v1ish.

If you’ve built anything you can submit it!

Would love some feedback.

reddit.com
u/StardustOfEarth — 3 days ago

most vibe coders ship to nobody. here's why.

been building solo for 2 years. shipped 11 things. zero of them made meaningful money. not "indie hacker lottery" zero, actual zero.

took a step back to figure out why and the pattern was the same on every one:

  • built first, asked who'd pay after
  • "validated" in twitter polls and reddit threads where free advice flows and money doesn't
  • shipped MVP to 30 friends who said "nice"
  • watched signups stall at 12 a week, called it a marketing problem

the marketing problem wasn't marketing. i was building things people would tolerate, not pay for. that's a different thing and free reddit feedback won't surface it.

what actually changed:

before writing a line of code on project 12, i posted a paid task asking 50 people in my niche to spend a week thinking about whether they'd pay for a tool that did X. the bounty was $5 per detailed response. spent like $250 total.

of 47 responses, 11 said they'd pay $20/mo for the v1 spec. 4 said they'd pay $50. the other 32 either said "i already use [tool]" (named existing solutions, useful) or "i don't have this problem". better data than 6 months of "this is cool" comments.

paid feedback beats free feedback because the person answering is committed. they read the brief, they write a real answer, they tell you which tools they already use. you can sort the data.

used Pond for it. people you've never heard of submit, you only pay the useful ones.

stopped building things nobody asked for. building things 11+ people have committed to via paid signal.

if you've been shipping into the void, the answer isn't more ads. it's better signal.

u/FlyComfortable3169 — 4 days ago