u/FindingOk1094

▲ 1 r/LLMDevs

I'm currently using gpt-4o-mini as the model for my openai api in my project. Even getting a response from a short prompt such as "What is your name?" takes 5-10 seconds. How do I reduce the latency, and optimise my project?

reddit.com

u/FindingOk1094 — 10 days ago