▲ 1 r/LLMDevs
I'm currently using gpt-4o-mini as the model for my openai api in my project. Even getting a response from a short prompt such as "What is your name?" takes 5-10 seconds. How do I reduce the latency, and optimise my project?
u/FindingOk1094 — 10 days ago