Monitoring at scale reduces the time required to monitor Foundry resources by introducing enhanced capabilities. Monitoring views are a collection of monitoring rules and health checks.
Monitoring views offer expanded functionality beyond what is available in check groups and we recommend upgrading all existing check groups to monitoring views.
With monitoring views, you can monitor the following resource types:
Resource type | Supported scope |
---|---|
Agent | Single, Project |
Object type | Single |
Link type | Single |
Schedule | Single, Project |
Streaming datasets | Single, Folder, Project |
Live deployments | Project |
Time series syncs | Single |
Geotemporal observations | Single |
Automations | Single, Project |
Dataset | Single, Folder, Project |
If you need detailed content and schema validation, consider using health checks.
To create a new monitoring view, navigate to the Monitoring View tab in the top right corner of the Data Health application and select New monitoring view.
In your new monitoring view, you can Add monitoring rules on the Manage monitors tab. First, select the resource type you are looking to monitor. Depending on the resource type, you can either choose to monitor just that resource on a single scope, or you can monitor all the resources of that type across a single or multiple project scope.
You must have Viewer
permission on the resources to monitor them. To receive alerts triggered by monitoring rules, you must have Viewer
permission on the resources and the monitoring view.
Additionally, you can review the Monitoring rules resource reference.
Monitors are set on the metrics emitted by a resource. As you set up your monitors, we suggest certain configurations based on Foundry’s standards for health. However, you can change the values or choose to only monitor certain metrics. You can also determine the level of severity for the alert. Currently, there are three severity types: low, medium, and high.
You can edit your monitors by selecting from the list of monitors and choosing Edit on the side panel that appears.
Once you have configured your monitoring rules, you will need to set up how and where alerts are delivered when issues are detected.
To subscribe to alerts, navigate to the Manage subscriptions tab where all the subscribed users are listed. You can add users and user groups, and configure their alerts based on severity. When a monitor rule triggers an alert, the user subscribed to the monitoring view containing that alert will be notified via email and Foundry notifications. Note that you must have Viewer
permission on the resources and the monitoring view to be able to receive alerts.
You can send alerts to external systems such as PagerDuty or Slack with built-in integrations or by using a webhook to hit arbitrary REST endpoints. Learn more about sending alerts to external systems.
On the Troubleshoot alerts tab, you can review your alerts by alert name, resource, failure reason, and the time of reported alert. Additionally, you can view options, group by project, or filter by type (for checks only).
To upgrade an existing check group, open your check group in the Data Health application. In the top banner, select Upgrade to monitoring view.
You can create a new monitoring view or move all the checks to an existing monitoring view.