You can use the Data Lineage graph to see which datasets in your pipeline are out of date, and then use the Builds helper to start builds directly from Data Lineage.
Builds triggered from Data Lineage always apply to the branches (including fallback branches) configured in the graph.
The following are a few common build workflows:
This strategy builds the selected datasets and all ancestor datasets, to ensure that the selected datasets become completely up to date.
By default, this builds only ancestors that are out of date, but you can choose to force a re-build of up-to-date datasets. Forcing a re-build can be expensive in terms of build time and resources.
Clicking Next will not trigger any builds yet. You will simply see a preview of the datasets to be built.
If you decide you do not want to build all out-of-date ancestors, you must click Cancel on the current build preview, then change the nodes you have selected. You cannot change your selection from the build preview screen.
This strategy lets you bind your builds to a subset of your pipeline. A common use case for this strategy can occur when new raw data regularly lands in your pipeline and there is a particular dataset that you want to update to reflect the new data, but you don’t want to build all out-of-date ancestors. You can then use Data Lineage to determine which other datasets need to be built to bring your dataset of interest more up to date.
Clicking Next will not trigger any builds yet. You will simply see a preview of the datasets to be built based on the nodes you have selected. You can now see exactly what needs to be built to update your dataset of interest. You may not want to build all datasets – maybe there is a very large derived dataset that should only build once a day – so click Add all to graph at the bottom of the list.
This strategy allows you to pick individual datasets that you want to build. If there are dependencies between the datasets, builds would be executed in the right order to assure descendants are built after their ancestors were built.
If you want to change the datasets you are building, you must click Cancel on the current build preview, change the nodes you have selected, then enter a new preview. You cannot change your build selection from the build preview screen.
After examining the final list of datasets to be built, click Run build to trigger the builds.