Loading...

How Azure AI Search powers RAG in ChatGPT and global scale apps

How Azure AI Search powers RAG in ChatGPT and global scale apps

Millions of people use Azure AI Search every day without knowing it. You can enable your apps with the same search that enables retrieval-augmented generation (RAG) capabilities when you build Custom GPTs or attach files in your ChatGPT prompts.

Pablo Castro, Microsoft CVP and Distinguished Engineer Azure AI Search, joins Jeremy Chapman to share how with Azure AI Search, you can create custom applications that retrieve the most relevant information quickly and accurately, even from billions of records.

Manage massive-scale datasets while maintaining high-quality search results with ultra-compact, binary quantized vector search indexes that use Matryoshka Representation Learning (MRL) and oversampling to equal the search accuracy of vector indexes up to 96 times larger. These approaches drive significant cost savings by optimizing your vector indexes without compromising quality.

► QUICK LINKS:
00:00 - RAG powered by Azure AI Search
00:50 - Azure AI Search role in ChatGPT
02:01
- Azure AI Search use case - AT&T
03:27 - Start in Azure Portal
04:35 - Massive scale and vector index
06:08 - Scalar & Binary Quantization
07:21 - Martyoshka technique
09:07 - Oversampling
11:31
- How to build an app using Azure AI Search
13:00 - See it in action
14:28 - Enable binary quantization with oversampling
14:54
- Wrap up

 

► Link References

Get sample code on GitHub at https://aka.ms/SearchQuantizationSample 

Check out search solutions at https://aka.ms/AzureAISearch

 

► Unfamiliar with Microsoft Mechanics? 

As Microsoft's official video series for IT, you can watch and share valuable content and demos of current and upcoming tech from the people who build it at Microsoft.

• Subscribe to our YouTube: https://www.youtube.com/c/MicrosoftMechanicsSeries

• Talk with other IT Pros, join us on the Microsoft Tech Community: https://techcommunity.microsoft.com/t5/microsoft-mechanics-blog/bg-p/MicrosoftMechanicsBlog

• Watch or listen from anywhere, subscribe to our podcast: https://microsoftmechanics.libsyn.com/podcast

 

► Keep getting this insider knowledge, join us on social:

• Follow us on Twitter: https://twitter.com/MSFTMechanics 

• Share knowledge on LinkedIn: https://www.linkedin.com/company/microsoft-mechanics/

• Enjoy us on Instagram: https://www.instagram.com/msftmechanics/

• Loosen up with us on TikTok: https://www.tiktok.com/@msftmechanics

Published on:

Learn more
Microsoft Mechanics Podcast
Microsoft Mechanics Podcast

Made for tech enthusiasts and IT professionals. Expanded coverage of your favorite technologies across Microsoft; including Office, Azure, Windows and Data Platforms. We'll even bring you broader topics such as device innovation with Surface, machine learning, and predictive analytics.

Share post:

Related posts

Episode 397 – Local LLMs: Why Every Microsoft 365 & Azure Pro Should Explore Them

Welcome to Episode 397 of the Microsoft Cloud IT Pro Podcast. In this episode, Scott and Ben dive into the world of local LLMs—large language ...

5 hours ago

Integrating Azure OpenAI models in your Projects: A Comprehensive Guide

In the previous blog, we have explored how to install and configure Azure OpenAI Service, now we will be unlocking its potential further by in...

11 hours ago

Effortless Scaling: Autoscale goes GA on vCore-based Azure Cosmos DB for MongoDB

We’re thrilled to announce that Autoscale is now generally available (GA) for vCore-based Azure Cosmos DB for MongoDB! Say goodbye to manual s...

1 day ago

Making MongoDB workloads more affordable with M10/M20 tiers in vCore-based Azure Cosmos DB

vCore based Azure Cosmos DB for MongoDB is expanding its offerings with the new cost-effective M10 and M20 tiers for vCore-based deployments. ...

1 day ago

Replacing jackson-databind with azure-json and azure-xml

This blog post explains how azure-json and azure-xml replaced jackson-databind in the Azure SDK for Java. The post Replacing jackson-databind ...

1 day ago

March Patches for Azure DevOps Server

Today we are releasing patches that impact our self-hosted product, Azure DevOps Server. We strongly encourage and recommend that all customer...

1 day ago

Implementing Chat History for AI Applications Using Azure Cosmos DB Go SDK

This blog post covers how to build a chat history implementation using Azure Cosmos DB for NoSQL Go SDK and langchaingo. If you are new to the...

2 days ago
Stay up to date with latest Microsoft Dynamics 365 and Power Platform news!
* Yes, I agree to the privacy policy