u/PalpitationOk3004

Looking for technical partners for initial build in AI eval service space

We’re building an independent research institution that measures whether enterprise AI product capability claims actually hold up. We design controlled evaluations, build verified ground truth datasets, run systematic benchmarks, and publish findings used by PE investors and enterprise buyers.

We’re now building the data infrastructure — automated signal aggregation pipelines, benchmark runners, a structured intelligence database. The stack is Python, FastAPI, Postgres, and LLM APIs (OpenAI, Anthropic, Exa, Reddit).

We’re looking for talent in India

Two types of people we’re looking for:

Option A — Contractual

You run a dev shop or take freelance projects. You’re good, you’re fast, and you want a well-scoped engagement. Rate card arrangements, monthly billing. Fine with this being transactional.

Option B — Mission-driven

You’re genuinely excited about AI evaluation infrastructure and want to be part of building something from scratch. Lower compensation to start, but you’d be an early team member with a path to a full-time founding engineer role as the Lab scales. Equity conversation when the time is right.

To avoid back and forth — DM me with the following, clearly structured:

If you’re Option A: your rate (monthly, in INR), 2-3 examples of similar projects you’ve shipped with links or screenshots, your availability, and one line on why this is interesting to you.

If you’re Option B: your background, LinkedIn, Github, other profiles

Generic “I’m interested, let’s chat” messages won’t get a response.

reddit.com
u/PalpitationOk3004 — 1 day ago

[For hire]: Looking for early technical partners/ talent

We’re building an independent research institution that measures whether enterprise AI product capability claims actually hold up. We design controlled evaluations, build verified ground truth datasets, run systematic benchmarks, and publish findings used by PE investors and enterprise buyers.

We’re now building the data infrastructure — automated signal aggregation pipelines, benchmark runners, a structured intelligence database. The stack is Python, FastAPI, Postgres, and LLM APIs (OpenAI, Anthropic, Exa, Reddit).

We’re looking for talent in India

Two types of people we’re looking for:

Option A — Contractual

You run a dev shop or take freelance projects. You’re good, you’re fast, and you want a well-scoped engagement. Rate card arrangements, monthly billing. Fine with this being transactional.

Option B — Mission-driven

You’re genuinely excited about AI evaluation infrastructure and want to be part of building something from scratch. Lower compensation to start, but you’d be an early team member with a path to a full-time founding engineer role as the Lab scales. Equity conversation when the time is right.

To avoid back and forth — DM me with the following, clearly structured:

If you’re Option A: your rate (monthly, in INR), 2-3 examples of similar projects you’ve shipped with links or screenshots, your availability, and one line on why this is interesting to you.

If you’re Option B: your background, LinkedIn, Github, other profiles

Generic “I’m interested, let’s chat” messages won’t get a response.

reddit.com
u/PalpitationOk3004 — 1 day ago

Looking for technical partners / talent

We’re building an independent research institution that measures whether enterprise AI product capability claims actually hold up. We design controlled evaluations, build verified ground truth datasets, run systematic benchmarks, and publish findings used by PE investors and enterprise buyers.

We’re now building the data infrastructure — automated signal aggregation pipelines, benchmark runners, a structured intelligence database. The stack is Python, FastAPI, Postgres, and LLM APIs (OpenAI, Anthropic, Exa, Reddit).

We’re looking for talent in India

Two types of people we’re looking for:

Option A — Contractual

You run a dev shop or take freelance projects. You’re good, you’re fast, and you want a well-scoped engagement. Rate card arrangements, monthly billing. Fine with this being transactional.

Option B — Mission-driven

You’re genuinely excited about AI evaluation infrastructure and want to be part of building something from scratch. Lower compensation to start, but you’d be an early team member with a path to a full-time founding engineer role as the Lab scales. Equity conversation when the time is right.

To avoid back and forth — DM me with the following, clearly structured:

If you’re Option A: your rate (monthly, in INR), 2-3 examples of similar projects you’ve shipped with links or screenshots, your availability, and one line on why this is interesting to you.

If you’re Option B: your background, LinkedIn, Github, other profiles

Generic “I’m interested, let’s chat” messages won’t get a response.

reddit.com
u/PalpitationOk3004 — 1 day ago

Looking for initial AI/ ML Dev talent to kickstart an AI Eval service company

We’re building an independent research institution that measures whether enterprise AI product capability claims actually hold up. We design controlled evaluations, build verified ground truth datasets, run systematic benchmarks, and publish findings used by PE investors and enterprise buyers.

We’re now building the data infrastructure — automated signal aggregation pipelines, benchmark runners, a structured intelligence database. The stack is Python, FastAPI, Postgres, and LLM APIs (OpenAI, Anthropic, Exa, Reddit).

We’re looking for talent in India

Two types of people we’re looking for:

Option A — Contractual

You run a dev shop or take freelance projects. You’re good, you’re fast, and you want a well-scoped engagement. Rate card arrangements, monthly billing. Fine with this being transactional.

Option B — Mission-driven

You’re genuinely excited about AI evaluation infrastructure and want to be part of building something from scratch. Lower compensation to start, but you’d be an early team member with a path to a full-time founding engineer role as the Lab scales. Equity conversation when the time is right.

The work involves:

•	Automated data pipelines scraping and structuring signals from public sources

•	LLM evaluation runners and scoring infrastructure

•	Backend APIs and a structured intelligence database

•	Integration with evaluation tooling like Braintrust

To avoid back and forth — DM me with the following, clearly structured:

If you’re Option A: your rate (monthly, in INR), 2-3 examples of similar projects you’ve shipped with links or screenshots, your availability, and one line on why this is interesting to you.

If you’re Option B: your background, LinkedIn, Github, other profiles

Generic “I’m interested, let’s chat” messages won’t get a response.

reddit.com
u/PalpitationOk3004 — 1 day ago