This content is also available at learn.palantir.com ↗ and is presented here for accessibility purposes.
📖 Task Introduction
You currently have an early stage sequence of preparatory transformations: raw source → “local” raw copy → preprocessed. The output you create in this tutorial will serve as the input to the next stage in your pipeline. In this first task, you’ll update your repository structure to prepare for your cleaning logic.
🔨 Task Instructions
Open your flight_alerts_logic repository in yourDatasource Project: Flight Alerts project folder.
Create a new branch from Master called yourName/feature/project_output_tutorial (e.g., jmeier/feature/project_output_tutorial).
In the repository Files panel on the left, right click on the /datasets folder and create a new sub-folder named clean.
Add a new file inside your /clean folder called flight_alerts_clean.py.
ℹ️ You may have noticed that by default, your repository assumes the output of your transform file should match the name you give it. In this example, creating a flight_alerts_clean.py file creates an changeable output path that initially ends with flight_alerts_clean.