1. Data Pipeline Foundations10. Transforming

10 - Transforming

This content is also available at learn.palantir.com ↗ and is presented here for accessibility purposes.

Foundry has two applications for production data transformation: Pipeline Builder and Code Repositories. This learning path provides an opportunity to practice using each.

Pipeline Builder is the primary application for building production pipelines in Foundry. Pipeline Builder's low/no-code orientation gives more Foundry users access to the pipeline development and maintenance process. You should begin all new data pipelines with Pipeline Builder and use Code Repositories when you need specialized code-based logic not available in Pipeline Builder, such as using custom libraries, writing complex user-defined functions, or making API calls.

These applications can be used together; you can use datasets output by Pipeline Builder as inputs to transformations in a code repository and vice versa.

In this learning path, you’ll also encounter the Job Tracker application, which enables data engineers to monitor the status of dataset builds, including Spark details.