This content is also available at learn.palantir.com ↗ and is presented here for accessibility purposes.
After deploying your pipeline, you receive feedback that the flight_alerts_clean
dataset could use some improvements:
rule_id
column is not useful in any context and should be removed.comment
and assignee
columns that can serve as editable properties on Ontology object types backed by this pipeline. These should have None
values by default.In this task, you’ll branch your clean pipeline, develop and test these changes, and then merge them back into the main branch. The instructions below will prompt you through branching in Pipeline Builder. If you would like to read up on branching in Pipeline Builder, refer to these resources: Pipeline Builder branching documentation: Create a branch → Propose a change → Approve a change
Open your clean pipeline and create a new branch called develop
by clicking the Main ▾ dropdown in the upper right of the screen.
Introduce a new transform step between your output clean dataset and the Flight Alert Status Join
transform.
Label your new transform path Additional Cleaning
and add transforms that:
rule_id
columncomment
with a default string value of None
assignee
with a default string value of None
Return to your graph, where your new transform should be automatically connected to your flight_alerts_clean
output. Note that Pipeline Builder has surfaced a schema mismatch; the output is looking for rule_id
and is not properly configured to accept the new constant columns.
In the Pipeline outputs panel on the right, click the Edit button and then on the blue Use updated schema button to re-baseline the schema to match your transform logic.
Deploy your pipeline.
Right click on your output dataset node and choose Open (click the ↗
button to the right to open the dataset in a new tab).
In the upper left area of the Dataset Preview app, you can see your dataset now exists on two branches: Master (corresponds with Main in your pipeline) and Develop, which you just built (the dataset might actually still be building).
Your changes look good. Return to your pipeline and click the Propose button in the upper right of the screen and follow the prompts to merge your Develop branch into Main. Before merging, be sure to select the option to Deploy branch "Main" after merging this proposal.