How to ensure quality and avoid inaccuracies in your data insights
In this comprehensive session, discover how to ensure accuracy and reliability in your data projects by implementing testing methodologies for notebooks, pipelines, and Power BI reports.
James Broome, Director of Engineering, explores real-world examples from dozens of endjin's data & analytics projects; teaching techniques like unit testing in Fabric Notebooks, snapshot testing for ETL processes in Synapse Pipelines, and user acceptance testing for Power BI models. Learn to avoid common pitfalls and enhance your data practices with effective testing strategies.
Chapters
- 00:00 Introduction and Context
- 00:16 The Importance of Accurate Data Reporting
- 01:53 Challenges in Data Testing
- 03:40 Common Excuses for Lack of Testing
- 05:29 Levels of Software Testing
- 08:23 Unit Testing with Fabric Notebooks
- 11:28 Refactoring for Testability
- 19:09 Integration Testing with Pipelines
- 22:45 Testing Pipeline Behaviour with Valid and Invalid Data
- 23:40 Running and Monitoring Pipeline Tests
- 25:21 Snapshot Testing for Data Validation
- 31:05 User Acceptance Testing in Power BI
- 33:51 Creating Executable Specifications with Gherkin
- 34:40 Building Dynamic DAX Queries for Testing
- 37:18 Testing Power BI Reports Locally and in the Cloud
- 43:17 Conclusion and Final Thoughts
Published on:
Learn moreRelated posts
Microsoft Fabric Services! Is It Worth The Hype?
In the realm of data services, Microsoft Fabric services has emerged as a formidable competitor, offering a suite of tools such as Data Factor...
Connect Power BI and Spark notebooks with Microsoft Fabric Semantic Link
The new Semantic Link feature in Microsoft Fabric is creating quite a buzz in the world of data analytics. With this feature, it is now possib...
Killing me softly – Has Microsoft Fabric just “overwritten” Synapse Analytics?
If you're a user of Synapse Analytics, you may be feeling a bit uncertain about the future with the emergence of Microsoft Fabric. In this art...
A Tour Around Microsoft Fabric
This video takes you on a comprehensive tour of Microsoft Fabric given by Ed Freeman. Ed is a Senior Data Engineer who spent six months in the...
Microsoft Fabric: Announced
Microsoft Fabric extends the promise of Azure Synapse integration to all analytics workloads from the data engineer to the business knowledge ...
Power BI, Synapse and Data Factory news from Microsoft Build 2023 (Day 2)
Microsoft Build 2023 has brought exciting developments for Power BI, Synapse, and Data Factory. This two-day digital event has unveiled a plet...
Unleash your Dynamics 365 Data with Azure Synapse Analytics and Power BI
If you're looking to maximize the value of your Dynamics 365 data, this video tutorial is a must-watch. Here, you'll learn how to leverage the...
Parameterize your Notebooks in Azure Synapse
In this video, Patrick walks you through the process of parameterizing your notebooks in Azure Synapse Analytics, in a simple, easy-to-underst...
Azure Data Factory/Synapse Pipeline Tip : Google Sheet Connector - An Intro
Get to know the Google Sheet Connector in this informative tutorial on Azure Data Factory/Synapse Pipeline Tips. While the description and con...
Azure Data Factory / Synapse Pipeline (ADF) Quick Tip: Lake Databases – An Overview
This video provides a quick tip for working with Azure Data Factory or Synapse Pipeline(AFD) and understanding lake databases. The speaker pro...