VoiceRAG

**Why are we doing this?**
Voice is a natural way to interact with AI. By adding real-time voice to **gpt-rag**, we make retrieval-augmented assistants more engaging, accessible, and useful in scenarios like meetings, customer support, and live collaboration where hands-free or multilingual interaction is essential.

**What does it do?**

* **Voice-enabled RAG** – Adds “speech in, speech out” to **gpt-rag**, letting users query enterprise knowledge by voice and receive spoken, retrieval-grounded responses.
* **Phone Integration** – Lets user call a phone number and interact with the assistant or assistant doing outbound calls.
* **Realtime reasoning** – Uses the Azure OpenAI GPT Realtime API for low-latency transcription, retrieval, and response synthesis over enterprise data sources.
* **Use cases** – Meeting assistants, customer service bots, live Q\&A in Teams, and multilingual knowledge agents.

* Nice to have: **Teams integration** – Lets VoiceRAG join Microsoft Teams calls, capture live audio queries, and provide contextual answers in real time.

**Technical Guidelines**

**High Level Solution Architecture**

<img width="605" height="187" alt="Image" src="https://github.com/user-attachments/assets/3ffc6b1f-f5aa-404c-a918-a639573cee3f" />

**References**
* Related IP:
    * https://github.com/Azure-Samples/realtime-call-center-accelerator
    * GBBs : https://github.com/Azure-Samples/art-voice-agent-accelerator
    * Sash's repo: https://github.com/sashgeorge/VoiceChat-Human-Agent-ACS-GPT-Realtime
<img width="550" alt="Image" src="https://github.com/user-attachments/assets/3f0ef792-c848-45a6-9957-1afab83ffb4a" />

Other
* https://techcommunity.microsoft.com/blog/azure-ai-foundry-blog/upgrade-your-voice-agent-with-azure-ai-voice-live-api/4458247
* [Integrate Microsoft Teams real-time media bots via Graph Cloud Communications API or Bot Framework](https://learn.microsoft.com/en-us/graph/api/resources/communications-api-overview?view=graph-rest-1.0).
* https://learn.microsoft.com/en-us/microsoftteams/platform/bots/calls-and-meetings/calls-meetings-bots-overview
* https://learn.microsoft.com/en-us/azure/ai-foundry/openai/how-to/realtime-audio-webrtc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

VoiceRAG #374

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

VoiceRAG #374

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions