Visualization System and API and Proxy server for Agentic Runs
We are building a visualization tool for agentic loads that group the llm and tool calls (inferred from http requests and responses). This is accomplished without any instrumentation of the agentic code. Agentic code can be in any language.
The agentic task needs to be started with a rust correlator that uses http headers to group together llm calls. We have an API server and a reverse proxy that forwards llm calls to providers.
The front end gives the usual statistics like input/output tokens, cost, model latency, latency of proxy (minimal in microseconds as servers implemented in rust).
We would appreciate comments from people who are in AI ops who use tools like litellm and Helicone and can provide some input on complicated use cases.
Next step is to normalise the JSON body from one provider to another to make agentic code provider agnostic.
We are looking for collaborators too as we are making this open source.