u/Low_Marionberry3072

Hi there, I am currently working on extracting and structuring scanned financial business plans via LLMs, I am using Qwen for data OCR extraction and it really works but I am suffering with organizing my data cause my pdfs can be in multiple schemas which need a lot of reasoning I ve tried many LLMs like deepseek mistral... way far from desired result.

Constraint: only open source models are valid

reddit.com
u/Low_Marionberry3072 — 18 days ago