In this video we will generate a report for the data quality tests we run against our data. In the previous two sessions, we covered Pytest basics and integrated Pytest into our Python ETL pipeline. This report will display the test name and the results. It can serve as a documentation and status of the tests. The Pytest generates the report as an html document. We can share these documents with stakeholders as well as compile them into a website that showcases the results of the data quality test. The website can be accessed by any user who is interested in the results of the data quality tests. This documentation of the data quality tests will help the consumers understand the data quality test in place and will notify them of any failures.
ETL Pipeline video: • How to build an ETL pipeline with Pyt...
Previous Pytest video: • How to test your Python ETL pipelines...
First video on ETL Testing : • How to test your Python ETL pipelines...
Link to GitHub repo (code): https://github.com/hnawaz007/pythonda...
#etltesting #dataquality #etl
Pytest Docs: https://docs.pytest.org/en/7.2.x/
Subscribe to our channel:
/ haqnawaz
---------------------------------------------
Follow me on social media!
Github: https://github.com/hnawaz007
Instagram: / bi_insights_inc
LinkedIn: / haq-nawaz
---------------------------------------------
Topics covered in this video:
0:00 - Introduction to ETL testing
0:35 - Data Quality Report
1:33 - Data Quality Script
2:18 - Generate Data Quality Report
2:40 - Review Data Quality Report
3:28 - Standalone Data Quality Report