5A. [Repositories] Working with Raw Files in Code Repositories6. Exercise Summary

6 - Exercise Summary

This content is also available at learn.palantir.com ↗ and is presented here for accessibility purposes.

✅ What you built

  1. A Datasource Project: Passengers project.
  2. A passengers_logic Code Repository.
  3. Python transforms that create “local” project copies of the raw, unparsed JSON and CSV files.
  4. “Raw” dataset files built on the Master branch.

✅ What you learned

  • Foundry datasets are most often composed of Parquet ↗ files, which are optimized for Spark.
  • Datasets can contain raw CSV and JSON files without a schema.
  • The Foundry Explorer helper is a file navigation interface that lets you quickly browse all files and folders and preview selected datasets.