Easily deploy a pgvector-enabled PostgreSQL server to Azure
The pgvector extension was first released in 2021 but has shot up in popularity in the last year, as developers discover what is possible with vector embeddings, vector similarity, and vector search. Once we start storing embedding vectors in database rows, we can make queries like "which movies are more similar to each other, based on their synopsis?" and "which retail item's descriptions most closely match this user's query?"
What is pgvector?
The pgvector extension allows PostgreSQL users to store columns of a vector type, create an index (HNSW or IVF) to efficiently index the vector fields, and query using vector distance operators (cosine, Euclidean, or inner product). Or, to put that in SQL form:
Note: While the extension is typically referred to as "pgvector", the actual extension name is "vector", so that is what's used in the CREATE EXTENSION statement above.
Using pgvector on Azure PaaS
Azure now has multiple PaaS offerings for PostgreSQL that support the pgvector extension: PostgreSQL Flexible Server and Azure Cosmos DB PostgreSQL server.
If you already have an existing Azure PostgreSQL server, you can enable the extension manually in the Portal, as shown below and described in the linked tutorials.
A PostgreSQL flexible server with pgvector extension enabled.
An easy template for pgvector deployment
To make it even easier to get started with pgvector on PostgreSQL Flexible Server, we've created a template project that contains infrastructure-as-code (Bicep files) and support for the Azure Developer CLI (azd). Clone or download the project here:
https://github.com/Azure-Samples/azure-postgres-pgvector-python
With a few commands, you'll have a pgvector-enabled PostgreSQL server provisioned in your Azure account. We've also added keyless authentication to the template, so you can authenticate with your Azure credential instead of a secret.
The template project includes multiple Python scripts showing you how to connect to your PostgreSQL server and use the pgvector extension in the most common SQL packages: psycopg2, asyncpg, SQLAlchemy, and SQLModel.
For example, here's a selection of the code from a SQLAlchemy example for storing movie titles and their embeddings:
See the full code in the repository.
There are so many ways that you can use the pgvector extension once you've gotten started with it, both as a tool in building generative AI applications (especially RAG apps), but also in any situation where similarity is a heuristic, like recommendations, fraud detection, and more. Start bringing vectors into your apps today and let us know what you build!
Published on:
Learn moreRelated posts
Announcing Azure DevOps Server General Availability
We’re thrilled to announce that Azure DevOps Server is now generally available (GA)! This release marks the transition from the Release Candid...
Blue-green deployment in Azure Container Apps using Azure Developer CLI
Learn how to implement blue-green deployment in Azure Container Apps using Azure Developer CLI (azd) revision-based deployment strategy. The p...
Zonal resiliency in Azure
Microsoft Purview: Azure AI Foundry integration with Microsoft Purview for AI
Purview enablement in AI Foundry, allows Foundry admins to activate Microsoft Purview on their subscription. Once enabled, AI interaction data...
Long-term data retention up to 10 years: Announcing Private Preview of Azure Backup for Azure Cosmos DB
Azure Backup for Azure Cosmos DB is a new option that lets you securely protect and recover your Azure Cosmos DB data for compliance, audit, a...
Assessing Your Azure Data Factory for Migration to Fabric Data Factory
As organizations modernize their data integration workflows, moving from Azure Data Factory (ADF) to Fabric Data Factory is an important miles...
Video: Copilot Studio – Connect Azure SQL As Knowledge
SQL is where the worlds data is stored. And in this video I’m going to ... The post Video: Copilot Studio – Connect Azure SQL As Knowled...
Azure Backup Threat Detection
One Azure, Many Logins: How Users Access Microsoft’s Cloud Safely
Users can access Microsoft Azure through several flexible and secure methods, depending on their role, device, and workload needs. The most co...