This content is also available at learn.palantir.com ↗ and is presented here for accessibility purposes.
A clean dataset is one that is ready for use in Foundry workflows. For our case, this means joining the text-based mapped_values in the status and priority mapping tables to the flight alerts dataset. That way, the flights dataset will contain values your colleagues can more readily use in operational settings. Join transforms in Pipeline Builder work similarly to how SQL joins work. If you are unfamiliar with joins, brush up on some basics again using a tutorial from W3 Schools ↗. A quick overview of how joins work in Pipeline Builder can be found in this part of the documentation.
Select the flight_alerts_preprocessed
node and add a join transform.
Note: Order matters. E.g. If you are executing a left join on the flight_alerts_preprocessed dataset, this must also appear on the left-hand side of the input tables field. If you started the wrong way around, don't worry, you can always click "swap" to configure which table is the left table and which one the right table.
Use the following tips to join with the priority_mapping_preprocessed
node.
priority
column in the flight alerts dataset matches the value
column in the priority mapping dataset.priority
column from the left side of the join and the value
(or priority_value
if you added a prefix) from the right.On the join page, click into the join name field in the upper left area of the screen and rename it from the default (Join
) to Flight Alert Priority Join
.
Select Apply and confirm the preview at the bottom looks accurate.
If you are stuck, scroll down to see an example solution.
Save your pipeline.