Build a agent to interact with Parquet
I have a Loan Portfolio for 8 quarters in Parquet form. It consists of information like Zone, Region, Branch, Account Name, Account No, Asset Status (SMA0/SMA1/SMA2/NPA/Regular), Account Open Date, Outstanding, NPA Date.
I want to build an agent that will interact with this parquet file and answer the query of the user. The queries can be like:
Give me the bank's portfolio outstanding for a given quarter
Which Zone/Region has the highest/lowest Portfolio outstanding/NPA/SMA?
Which Zone is an outlier in terms of NPA
Which Zone is showing an increasing trend of delinquency. Which branch is contributing for the same.
Provide a Line and Bar chart showing the portfolio growth across quarters and the number of accounts.
Visually show the share of each zone in a given quarter ?
These are just sample questions. It can be anything pertaining to the parquet.
I tried building one using Ollama with help from chatgpt but it's not getting anywhere.
It's very unstable, gives the wrong output and as per chatgpt I need to hardcode all the metrics, levels, semantics, filters, etc. it feels as if I am working on excel.
Can anyone guide me on what the approach should be?