The process of monitoring a pipeline is usually best managed by implementing on-call rotations. This means that one team member is actively monitoring the pipeline at a time ("on-call"), and responding to pipeline issues (usually in the form of failing health checks) is her most important priority for the duration of her on-call rotation.
The following steps are recommended for setting up an effective pipeline monitoring team:
documentation
folder of the Project where the key outputs of the pipeline live.An alerting mechanism allows you to respond reactively to health checks failing in your pipeline. This alleviates the need to periodically check a Data Lineage graph, dashboard, or report to see what the status of your pipeline is. Choosing the appropriate alerting mechanism depends on the scale of alerts and how tight your SLAs are (as this dictates how critical response time is).
The available options for automated alerting include:
Regardless of which option you implement, it is useful to implement filters so that you don't miss the alerts among other Foundry platform notifications.