▲ 15 r/dataengineering
How do you safely share production data with dev/QA teams?
I’ve been running into this problem where I need to share production CSV data with dev/QA teams, but obviously can’t expose PII.
So far I’ve tried:
- manually masking columns
- writing small scripts
But it’s still a bit tedious and error-prone, especially when relationships between fields need to be preserved.
Curious how others are handling this in real workflows?
Are you using internal tools, scripts, or something else?
u/Lower-Candle3471 — 21 hours ago