▲ 2 r/AZURE
Securely integrating external open-source legal data (Git repo + MCP server) into Azure cloud — best practices?
Hey everyone,
I’m relatively new to this side of things and could use some war stories.
I'm building an LLM hub on Azure and need to integrate two legal data
open-source projects:
- GitHub— full archive of laws as Markdown files in Git
- MCP server wrapping official law API for real-time legal queries from LLMs
Problem is, our firm’s security policy doesn't allow direct external connections
(no direct Git clone, no direct outbound API calls from production).
Looking for advice on:
- Secure integration patterns — Git mirroring through DMZ? APIM as API gateway
for external calls? Any battle-tested approaches? - Additional Azure resources needed — beyond APIM, Firewall, Key Vault,
what else should I plan for? - Rough effort estimate — anyone done something similar? How many man-days
did it take including infra + pipeline?
For context: planning to chunk the legal data → embed → Azure AI Search for RAG,
with incremental sync when laws are amended.
Any experience sharing would be hugely appreciated. Thanks!
u/Great-Perception-714 — 15 hours ago