Loading...

Create ADF Events trigger that runs an ADF pipeline in response to Azure Storage events.

Create ADF Events trigger that runs an ADF pipeline in response to Azure Storage events.

Storage Event Trigger in Azure Data Factory is the building block to build an event-driven ETL/ELT architecture (EDA). Data Factory's native integration with Azure Event Grid let you trigger processing pipeline based upon certain events. Currently, Storage Event Triggers support events with Azure Data Lake Storage Gen2 and General Purpose version 2 storage accounts, including Blob Created and Blob Deleted.

 

Event-driven architecture (EDA) is a common data integration pattern that involves production, detection, consumption, and reaction to events. Data integration scenarios often require customers to trigger pipelines based on events happening in storage account, such as the arrival or deletion of a file in Azure Blob Storage account. Data Factory and Synapse pipelines natively integrate with Azure Event Grid, which lets you trigger pipelines on such events.

 

This blog demonstrates how we can use ADF triggers for running the ADF pipeline in events of Azure Storage events.

 

Prerequisites:

  • An ADLS Gen2 storage account or GPv2 Blob Storage Account
    Create a storage account - Azure Storage | Microsoft Docs
  • The integration described in this article depends on Azure Event Grid. Make sure that your subscription is registered with the Event Grid resource provider. For more info, see Resource providers and types. You must be able to do the Microsoft.EventGrid/eventSubscriptions/* action. This action is part of the EventGrid EventSubscription Contributor built-in role.

    To do so, the Resource Provider 'Microsoft.EventGrid' needs to be registered in the Subscription as per the below screenshot: 

RidhimaSinha_0-1652416307749.png

 

  • If the blob storage account resides behind a private endpoint and blocks public network access, you need to configure network rules to allow communications from blob storage to Azure Event Grid. You can either grant storage access to trusted Azure services, such as Event Grid, following Storage documentation, or configure private endpoints for Event Grid that map to VNet address space, following Event Grid documentation
     
  • The Storage Event Trigger currently supports only Azure Data Lake Storage Gen2 and General-purpose version 2 storage accounts.

 

  • To create a new or modify an existing Storage Event Trigger, the Azure account used to log into the service and publish the storage event trigger must have appropriate role based access control (Azure RBAC) permission on the storage account

  • Service Principal for the Azure Data Factory does not need special permission to either the Storage account or Event Grid

 

Demo:                                    

 

Step 1:

Create ADF resource on Azure Portal.  If you are new to ADF, please refer this link on how to create one:
Create an Azure data factory using the Azure Data Factory UI - Azure Data Factory | Microsoft Docs

 

Step 2:

Once Data Factory is created, navigate to Azure Data Factory Studio present in the Overview section:

 

RidhimaSinha_1-1652416307755.png

 

Step 3:

As we land on the ADF portal, Create Linked service for storage account in ADF Portal as per the below screenshots:

RidhimaSinha_2-1652416307759.png

 

Once you click on ‘+New’, we need to first select the data source. If you are using GPv2 Blob Storage account use ‘Azure Blob Storage’ and if you are working with ADLS Gen2 account use ‘Azure Data Lake Storage Gen2’. I’ve used Gen2 in this demo.

RidhimaSinha_3-1652416307761.png

After selecting the Data Store fill in required details as below:

 

RidhimaSinha_4-1652416307764.png

 

Once the Test Connection is successful, click on ‘Create. This will create the storage account linked service.

 

Step 4:

Creating Input and Output Datasets

In this demo, we will create a simple ADF pipeline that will copy an ‘emp.txt’ file from one folder ‘input’ to another folder ‘output’ within a container. Hence, we need input and output datasets in ADF that maps to the blobs in input and output folder. So let’s create InputDataset and OutputDataset:

 

Go to ‘Author’ on ADF portal and click on ‘New Dataset’ as per below screenshot:

 

RidhimaSinha_5-1652416307766.png

 

 

RidhimaSinha_6-1652416307769.png

 

 

RidhimaSinha_7-1652416307771.png

 

 

RidhimaSinha_8-1652416307772.png

 

Then click on ‘Ok’.

Similarly, you can create OutputDataset as below:

RidhimaSinha_9-1652416307773.png 

 

Step 5:

Create the ADF pipeline to copy data from ‘input’ to ‘output’ folder as per the below screenshots:

 

RidhimaSinha_10-1652416307776.png

 

Give a pipeline name and drag ‘Copy Data’ activity to the designer surface. Name the activity:

 

RidhimaSinha_11-1652416307781.png

 

Select Source and Sink as below:

RidhimaSinha_12-1652416307781.png

 

 

RidhimaSinha_13-1652416307782.png

 

Now ‘Validate’ the pipeline and ‘Debug’ to check whether it works as expected.

 

RidhimaSinha_14-1652416307782.png

Step 6:

Once the pipeline is validated, let’s Create BlobCreated event Trigger as per below screenshot:

 

RidhimaSinha_15-1652416307784.png

 

Choose Trigger--> New:

 

RidhimaSinha_16-1652416307788.png

 

After clicking on ‘Continue’, you will get ‘Data Preview’.  This shows the blobs that matches the event trigger filters thus you can verify whether the filter is correct or not. Click ‘Continue’ and you will see ‘Parameters’ section. This is helpful when you want to pass any parameters to the pipeline. Skip this as we are not using parameters in this demo and click ‘Ok’.

 

Now we have all the components in place and next step would be to ‘Publish’ all the changes.

RidhimaSinha_17-1652416307791.png

 

Once publish is completed, let’s test the trigger.

 

Upload file ‘emp.txt’ to input folder and this should fire the BlobCreated event thus firing ADF trigger.

 

RidhimaSinha_18-1652416307793.png

 

File copied to output folder:

 

RidhimaSinha_19-1652416307795.png

 

ADF Trigger run:

 

RidhimaSinha_20-1652416307797.png

 

Pipeline run:

 

RidhimaSinha_21-1652416307800.png

 

As we see from the result screenshots above, the BlobCreated trigger works as expected and runs the attached ADF pipeline.

 

Similarly, BlobDeleted event can be created.

 

Reference link:

Create event-based triggers - Azure Data Factory & Azure Synapse | Microsoft Docs

 

Hope this helps!

 

Published on:

Learn more
Azure PaaS Blog articles
Azure PaaS Blog articles

Azure PaaS Blog articles

Share post:

Related posts

Introducing Azure HorizonDB - PostgreSQL

Run enterprise Postgres workloads on Azure HorizonDB with around 3x the throughput of self-managed deployments — zone-resilient by default, no...

13 hours ago

Azure DevOps and GitHub: Journeying into the AI Era

AI is changing how software gets planned, built, and reviewed. As teams adopt agentic development, the platform underneath those workflows mat...

1 day ago

Introducing azure-functions-skills: An AI-Era Workspace for Azure Functions (Preview)

azure-functions-skills gives GitHub Copilot CLI, Claude Code, Codex CLI, and VS Code the skills, MCP configuration, hooks, and instructions ne...

1 day ago

Announcing the Public Preview of Integrated Embeddings in Azure Cosmos DB: Build AI Apps With Embeddings That Stay in Sync

AI applications built on Azure Cosmos DB depend on embeddings for grounded results. Keeping them in sync with your data is the hard part: it m...

1 day ago

Introducing OmniVec: An Open-Source Embedding Platform for AI Apps on Azure

Today we are open-sourcing OmniVec, a platform for building and operating the embedding pipelines that keep the vector representation of your ...

1 day ago

Azure Cosmos DB All Versions and Deletes Change Feed Mode is Now Generally Available

Modern applications don’t just write data and move on. They react to it. A new order triggers an inventory update. A profile change sync...

1 day ago

Change Partition Keys in Azure Cosmos DB is Now Generally Available

We’re excited to announce the general availability of Change Partition Key in Azure Cosmos DB for NoSQL, now with online copy support. Y...

1 day ago

Announcing the General Availability of Per Partition Automatic Failover for Azure Cosmos DB NoSQL

Today, we are excited to announce the General Availability of Per Partition Automatic Failover (PPAF) for Azure Cosmos DB NoSQL API. PPAF is a...

1 day ago

Public Preview: AI-powered Azure Cosmos DB Migration Assistant for RDBMS to NoSQL

Today, we are excited to announce the public preview of the Azure Cosmos DB Migration Assistant for RDBMS to NoSQL, now available in the Azure...

1 day ago

Azure Cosmos DB MCP Toolkit Is Now Generally Available — Bringing Your Database to AI Agents at Scale

Since we introduced the Azure Cosmos DB MCP Toolkit at Ignite 2025 in preview, the response has been clear: developers want a straightforward ...

1 day ago
Stay up to date with latest Microsoft Dynamics 365 and Power Platform news!
* Yes, I agree to the privacy policy