u/DukeCabboom

Currently building a knowledge base system that needs to do RAG over internal SharePoint content. My current approach is pulling documents via the Graph API, chunking and vectorizing them externally (Azure AI Search on our side), and keeping everything in sync manually.
It works, but it’s a lot of plumbing to maintain — delta sync, permission propagation, re-indexing on updates, etc.
I’m curious if anyone here knows of Microsoft’s plans to make this more native. Something like SharePoint handling the embedding pipeline itself, maybe surfaced through Copilot extensibility or the Syntex/Content AI stack. There’s been noise about SharePoint Agents and Microsoft 365 Copilot connectors, but I haven’t found anything concrete on a roadmap for “real” RAG support where you just point at a library and get a semantic search endpoint back.
Has anyone found a cleaner path than the pull-and-vectorize-externally approach? Or seen anything from Microsoft Ignite or Build sessions suggesting this is coming natively?

reddit.com
u/DukeCabboom — 7 days ago