u/CoderMauro2008

▲ 58 r/SillyTavernAI+1 crossposts

NVIDIA NIM is inconsistent, so I benchmarked 20+ models every hour

If you're using NVIDIA NIM, you've probably noticed it's a bit unpredictable. Latency, success rates, and even availability can vary a lot depending on the model and time of day.

So I built NIMStats to track it 📊

It benchmarks 20+ models every hour using GitHub Actions and publishes everything to a live dashboard:

  • response times (which models are actually fast)
  • throughput (tokens/sec)
  • reliability over time (which ones fail less)
  • head-to-head comparisons

🌐 https://nimstats.maurodruwel.be/
💻 https://github.com/MauroDruwel/NIMStats

Fully open-source, zero infra cost ⚡ runs on GitHub Actions + Cloudflare Pages

Might help if you're trying to figure out which NIM models are actually usable in practice.

u/CoderMauro2008 — 12 days ago