9 - Passenger Object Type: Backing Datasets

This content is also available at learn.palantir.com ↗ and is presented here for accessibility purposes.

Remember that in a production environment, backing datasets are shaped and developed as part of a rigorously maintained data pipeline, never by a Contour analysis. In this lesson, we are merely borrowing Contour's ability to output a dataset in order to simulate such a pipeline.

🔨 Task Instructions

  1. Navigate back to your Pipeline Builder (“Training Pipeline Simulator”) used to output the backing dataset for your flight alert object type.
  2. After opening Pipeline Builder, follow these steps to create a new path that starts from the following dataset: ../ Foundry Training and Resources/Example Data/Aviation Ontology/passengers
    1. Click on Add data at the top of the data lineage.
    2. Then Add datasets from Foundry.
    3. Paste the path listed above in the Search all files field, select the + to the right of the passengers dataset from the left column, then click Add datasets.
  3. Click the + Add button near the top-right corner of the Pipeline outputs sidebar on the right, then click the + Add button next to Dataset. The right sidebar should now contain column names auto-populated based on the passengers dataset.
  4. Rename your output dataset to passengers_{yourname}_{date} using the field at the top of the right sidebar (above the columns and all other buttons, where the original name says New dataset {date})
  5. Repeat steps 2 through 4 to add /Foundry Training and Resources/Example Data/Aviation Ontology/Join Tables/passenger_flight_alerts and create a passengers_flight_alerts_{yourname}_{date} output dataset.
  6. To copy the data from the source dataset to the newly created output dataset, click the green "save" button at the top of the Pipeline Builder UI, and then click the blue "deploy" button close to it. When that button opens a pop-up, confirm the deployment by clicking the green "deploy pipeline" button.
  7. It may take up to few minutes for that new dataset to be ready - you can monitor the progress of the dataset by clicking the refresh wheel icon on the right side of the deploy button; you can click on it to view details or wait for it to become a green check mark that means the deployment was successful.