u/SabrinaScissorhands

We built a data scraper that pulls publicly available information. One issue we keep running into is CAPTCHA/rate-limit protection.

We currently use CAPTCHA-solving software, but it only works for a limited number of requests (around 10 or so) before the site starts blocking or rejecting requests again.

We already use CAPTCHA-solving software, but it eventually gets blocked.

Would rotating proxies/VPNs help with this? Are there different types people use for scraping setups? Any advice on making the scraper more stable without constantly triggering anti-bot systems?

We’d also appreciate any advice on better practices for scraping public sites without constantly triggering anti-bot protections.

Question about captchas