Looking for technical partners for initial build in AI eval service space
We’re building an independent research institution that measures whether enterprise AI product capability claims actually hold up. We design controlled evaluations, build verified ground truth datasets, run systematic benchmarks, and publish findings used by PE investors and enterprise buyers.
We’re now building the data infrastructure — automated signal aggregation pipelines, benchmark runners, a structured intelligence database. The stack is Python, FastAPI, Postgres, and LLM APIs (OpenAI, Anthropic, Exa, Reddit).
We’re looking for talent in India
Two types of people we’re looking for:
Option A — Contractual
You run a dev shop or take freelance projects. You’re good, you’re fast, and you want a well-scoped engagement. Rate card arrangements, monthly billing. Fine with this being transactional.
Option B — Mission-driven
You’re genuinely excited about AI evaluation infrastructure and want to be part of building something from scratch. Lower compensation to start, but you’d be an early team member with a path to a full-time founding engineer role as the Lab scales. Equity conversation when the time is right.
To avoid back and forth — DM me with the following, clearly structured:
If you’re Option A: your rate (monthly, in INR), 2-3 examples of similar projects you’ve shipped with links or screenshots, your availability, and one line on why this is interesting to you.
If you’re Option B: your background, LinkedIn, Github, other profiles
Generic “I’m interested, let’s chat” messages won’t get a response.