5B. [Repositories] Publishing and Using Shared Libraries in Code Repositories10. Are Your Datasets Up To Date

10 - Are your datasets up-to-date?

This content is also available at learn.palantir.com ↗ and is presented here for accessibility purposes.

📖 Task Introduction

You may have a general sense for how current your builds are, but as your pipeline increases in complexity and the dependency graph expands, it can be challenging to intuit whether part of your pipeline is up-to-date vis-à-vis the rest. In this task, we’ll take a break from our building narrative and head over to the Data Lineage graph to evaluate the recency of the datasets built in our flight_alerts_logic repository.

🔨 Task Instructions

  1. Having merged your feature branch into Master in your flight_alerts_logic repository, ctrl+click the Explore lineage button in the top right of your repository.

  2. In Data Lineage, change the node coloring options in the top right to out-of-date.

  3. At the bottom of the out-of-date node color legend, trying ticking and un-ticking the Data out-of date and Logic out-of-date options, noting the coloring differences. Navigate to the Data Lineage node coloring documentation and find the out-of-date item in the table for a description of the possible values.

    This is a valuable node coloring option for determining whether your pipeline is universally up-to-date and where to look to correct build issues. In our case, nothing new has been built on the Master branch, so the Data in the nodes all appears current. However, there is new logic on the Master branch, so ticking the Logic box reveals datasets out of sync with the new logic.

  4. Select all of the nodes on the graph with ctrl+a.

  5. Click the “hammer” icon (“Manage builds”) in the collapsed side menu on the right.

  6. Choose Selected dataset(s) only and then click the blue Next (View preview) button.

  7. Click the blue Run build button at the bottom right of the screen. Once the process completes, the datasets (Data and Logic) will appear updated on your graph.