3. [Builder] Creating a Project Output2. Create Your Cleaning Pipeline

2 - Create your cleaning pipeline

This content is also available at learn.palantir.com ↗ and is presented here for accessibility purposes.

📖 Task Introduction

You currently have an early stage sequence of preparatory transformations: raw source → “local” raw copy → preprocessed. The output you create in this tutorial will serve as the input to the next stage in your pipeline. In this first task, you’ll create the clean segment of your pipeline.

🔨 Task Instructions

  1. Proceed to the Datasource project folder that you created in the previous tutorial, e.g., .../Temporary Training Artifacts/${yourName}/Data Engineering Tutorials/Datasource Project: Flight Alerts/.
  2. Call your batch pipeline flight_alerts_datasource_clean.
  3. Add the three datasets from your /preprocessed folder.
  4. Optionally apply a color and label to the three preprocessed nodes.