u/Sad_Task604

▲ 2 r/vapiai+1 crossposts

Issues with German / Swiss German transcription in voice agent (missed words + delay)

Hey,

I’m building a voice agent with Vapi using German + Swiss German, and running into a few issues:

  • Audio works fine
  • STT misses simple words (even “hello”)
  • Dialects/accents make it worse
  • Sometimes the agent doesn’t respond at all
  • There’s also noticeable delay

Feels like either model choice / language config / VAD is off.

Questions:

  • Best STT model for German + Swiss German?
  • Better to force de-DE or use auto-detect?
  • Any tips for handling dialects reliably?
  • How do you reduce latency in these setups?

Would love to hear what worked for others 🙏

reddit.com
u/Sad_Task604 — 1 month ago