Evidently and Airflow
Run model evaluation or data drift analysis as a part of Airflow DAG.
Last updated
Run model evaluation or data drift analysis as a part of Airflow DAG.
Last updated
Apache Airflow is an open-source workflow management tool.
You can use this integration to generate JSON profiles or HTML reports and as a step in the Airflow DAG.
An integration example is available as a Docker container:
Follow the readme to install and modify the example.
It contains two specific DAGs that match common batch monitoring needs.
You can generate an Evidently report (e.g. a data drift report) every time the new data arrives. You can then store it in your file system.
Here is a DAG example:
You might not always need to generate visual reports every time.
For example, you can run checks on the model performance and only generate the reports if a certain condition is satisfied. For example, if you detect drift or performance drop. Otherwise, you can simply log the results.
Here is a DAG example:
It works as the following:
Run a data drift check by generating an Evidently JSON profile
If the drift is not detected, log the JSON output
If the drift is detected, generate and store a visual HTML report