u/Total-Newt4778

i used to think direct api calls were the standard way to connect to llm, but the stability issues with single providers changed my perspective on this. Here is the reality I learned the hard way. When you hardwire your app to a single provider, you do not own your uptime. All you could do is pray their servers stay alive. i got burned too many times by sudden rate limits hitting during peak traffic, or silent api timeouts that broke our entire automation chain. i end up spending hours writing custom retry logic that barely even works.

after that, I routed everything through an llm gateway like zenmux, which made a difference. The automatic failover means if one model drops, traffic just shifts to a backup.

I'm currently in the middle of the Mexican dual citizenship process, and things are taking much longer than I expected,

At first evervthing seemed straightforward, but now I'm running into delays with updates, document reviews, and getting clear answers. What's confusing is that every person I talk to seems to give a different timeline.

So I'm curious:

How long did your process actually take from start to finish?

Were there long periods with no updates? What ended up causing delays in your

Case?

Did the consulate/location make a big difference?

I'm trying to figure out what's considered "normal" versus signs that something might actually be wrong with the application.

Would really appreciate hearing real experiences from anyone who's gone through it recently.

Why i stopped using direct API calls for production LLMs

Dual citizenship Mexico delay - is this normal or should I be worried?