The following are some frequently asked questions about Data Lineage.
For general information, view our Data Lineage documentation.
First, add your object to the Data Lineage graph by searching for it in the right panel (the tab with a magnifying glass icon). Select Object types to filter your search, then enter the name of the object for which you want to view the backing and writeback datasets.
Next, select the arrow on the left side of your Object type to show its ancestors. This should produce one ancestor node if your object type is read-only and two ancestor nodes if your object type has writeback enabled. Make sure Resource overview is selected in the Node color options dropdown to see your Writeback Dataset colored as per the legend in the top right. Backing schema dataset colors depend on the transform type used.
Your writeback and backing datasets for an object type will also have a small globe icon in the top right.
Selecting one of these columns will highlight the datasets in your selection that contain this column.
In the dropdown menu in the top right side, choose Build Status. Now, you should be able to see if any dataset is currently running. Any such dataset has an open transaction.
Selecting a golden path will highlight the resources in this path on the graph. Hovering over a folder path will show you the full path.
You can select multiple properties in the Histogram of selection properties panel such that the graph highlights all resources that satisfy your selection.
To share your unsaved Data Lineage, select the arrow in the top right corner near Save. Once there, you can see a quick share link.
There are a few reasons why your dataset may not be up-to-date.
Consider the following reasons why your dataset may not be up-to-date:
You can easily answer these questions in Data Lineage:
First, verify the status of each of the resources in your pipeline by opening up the dataset of interest in Data Lineage and then right-clicking on the node.
Then, select Expand node.... You can view all ancestor nodes for that dataset by selecting the double left arrow above Expand parents....
Next, select the Build status option in the Node color options dropdown menu in the top right to view the build status of every resource in your pipeline. This view of your pipeline will make it easier to diagnose stale datasets.