open-source AI evaluation platform
**he problem I kept seeing:**
Companies are deploying AI agents into healthcare, legal, and finance. Their testing process is one developer asking it a few questions and saying "looks good."
The people who actually know what a correct answer looks like — doctors, lawyers, compliance officers — have zero tools they can use. Everything in the eval space requires Python, CLI setup, or JSON configs. Completely inaccessible to domain experts.
**What I built:**
EvalDesk — open source, self-hostable, no-code AI evaluation.
The workflow is three steps:
Designed specifically so a doctor or lawyer can use it without an engineer in the room. Self-hostable so sensitive data never leaves your infrastructure — critical for HIPAA and legal contexts.
**Current features:**
**What I'm looking for:**
Honest feedback. Is this solving a real problem or am I wrong about the gap? Anyone working in AI deployment in regulated industries — does this workflow actually match how your team operates?
GitHub: [https://github.com/ramandagar/EvalDesk\](https://github.com/ramandagar/EvalDesk)