Skip to content

Event-Driven Onboarding, Scanning and Classification of Data Sources and Setup of Lineage to Azure Purview.

License

Notifications You must be signed in to change notification settings

marvinbuss/PurviewAutomation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Microsoft Purview Event-driven Automation


This solution will simplify data governance tasks in an organization that uses Microsoft Purview as its core data catalog.


In a decentralized Data Platform like the Data Management & Analytics Scenario reference architecture, it becomes increasingly difficult for data governance personas to govern the data estate. Data Product teams can create their data services in a self-service way, which makes it increasingly difficult for data governance personas to onboard data sources, scan the content, classify the data and govern the quality of the data. This solution aims at automating these tasks.

The following automation is being taken care of by this solution:

  1. Discovery of Data Sources within the Data Platform.
  2. Onboarding of Data Sources within the respective Microsoft Purview Collection.
  3. Setup of Managed Private Endpoints onto the managed virtual network inside Purview.
  4. Setup of Scanning and Triggers for Data Sources.
  5. Setup of Lineage for supported Data Sources (Synapse, Data Factory, etc.).

Below you will find some of the core capabilities:

Event-driven onboarding of Data Sources

The solution onboards data sources automatically to a Microsoft Purview collection when a new data source gets created within an Azure subscription. The a collection structure will be automatically created within your Purview account.

Event-driven onboarding of Data Sources

Event-driven removal of Data Sources

The solution can remove data sources when they get deleted in Azure. Scaned data assets will not get removed when data sources get deleted from a Purview collection. This feature can be disabled.

Event-driven removal of Data Sources

Automated setup of Scans

The solution can automatically create and trigger scans for data services. This feature can be disabled.

Automated setup of Scans

Deployment

You have two options, to deploy this solution to your Azure tenant:

  1. GitHub Actions and
  2. Deploy to Azure Button.

Deploy To Azure

For more details, please visit the deployment documentation page.

More Details

  1. Deployment
  2. Supported Services
  3. Access Requirements
  4. Default Collection Structure
  5. Application Settings
  6. Architecture

About

Event-Driven Onboarding, Scanning and Classification of Data Sources and Setup of Lineage to Azure Purview.

Topics

Resources

License

Stars

Watchers

Forks