3. [Repositories] Creating a Project Output2. Updating Your Repository Folder Structure

2 - Updating your repository folder structure

This content is also available at learn.palantir.com ↗ and is presented here for accessibility purposes.

📖 Task Introduction

You currently have an early stage sequence of preparatory transformations: raw source → “local” raw copy → preprocessed. The output you create in this tutorial will serve as the input to the next stage in your pipeline. In this first task, you’ll update your repository structure to prepare for your cleaning logic.

🔨 Task Instructions

  1. Open your flight_alerts_logic repository in your Datasource Project: Flight Alerts project folder.
  2. Create a new branch from Master called yourName/feature/project_output_tutorial (e.g., jmeier/feature/project_output_tutorial).
  3. In the repository Files panel on the left, right click on the /datasets folder and create a new sub-folder named clean.
  4. Add a new file inside your /clean folder called flight_alerts_clean.py.

ℹ️ You may have noticed that by default, your repository assumes the output of your transform file should match the name you give it. In this example, creating a flight_alerts_clean.py file creates an changeable output path that initially ends with flight_alerts_clean.