u/AnalysisLogical465

Running an ADF pipeline that loads from **PostgreSQL -> ADLS (Parquet)**. ForEach runs 10 tables in parallel (`batchCount=10`), each table does:

  1. **Lookup** → get max watermark from source

  2. **Copy** → pull delta rows between old and new watermark

  3. **Script** → update watermark table in Azure SQL

PostgreSQL `max_connections = 100`. With 10 parallel iterations each opening 2+ connections, I'm hitting the ceiling and the pipeline fails. Grafana shows clear spikes up to ~80 connections right when the pipeline runs.

**What's the best approach?**

- Lower `batchCount`?

- Configure something on the Linked Service?

- Put a **PgBouncer** in front of Postgres?

- Something else entirely?

Is this a pipeline design problem or an infrastructure problem? Any experience with this appreciated.

reddit.com
u/AnalysisLogical465 — 14 days ago