
Qwen 3.6 and 3.5 (even 9b) are great models for local deep research
In the past we run some benchmarks for LDR and they seem to provide really competitive performance: https://huggingface.co/datasets/local-deep-research/ldr-benchmarks
(Disclaimer: I am the maintainer of local deep research)
Also note that these are self reported benchmarks and for example oss has very few examples. I am quite compute limited so the data is generated rather slowly and mostly for smaller local models.
u/ComplexIt — 3 days ago