u/anirbanm1991

▲ 2 r/LLM

Build a agent to interact with Parquet

I have a Loan Portfolio for 8 quarters in Parquet form. It consists of information like Zone, Region, Branch, Account Name, Account No, Asset Status (SMA0/SMA1/SMA2/NPA/Regular), Account Open Date, Outstanding, NPA Date.

I want to build an agent that will interact with this parquet file and answer the query of the user. The queries can be like:

Give me the bank's portfolio outstanding for a given quarter

Which Zone/Region has the highest/lowest Portfolio outstanding/NPA/SMA?

Which Zone is an outlier in terms of NPA

Which Zone is showing an increasing trend of delinquency. Which branch is contributing for the same.

Provide a Line and Bar chart showing the portfolio growth across quarters and the number of accounts.

Visually show the share of each zone in a given quarter ?

These are just sample questions. It can be anything pertaining to the parquet.

I tried building one using Ollama with help from chatgpt but it's not getting anywhere.

It's very unstable, gives the wrong output and as per chatgpt I need to hardcode all the metrics, levels, semantics, filters, etc. it feels as if I am working on excel.

Can anyone guide me on what the approach should be?

reddit.com
u/anirbanm1991 — 3 days ago