Transform data

You can start transforming and structuring your data in Pipeline Builder after adding datasets to your workspace.

Select a dataset

To apply a transform to a dataset, select a dataset node in your workspace and click Transform.

Screenshot of an operation selection menu highlighting the transform selection

Search for a transform

In the transform page, search for a transform type by name or browse from a list of available transforms. If you are using a structured (tabular) dataset, this field shows a comprehensive list of table transforms.

Screenshot of transforms search page

For semi-structured datasets like JSON files, the search field includes file transforms that allow you to parse your dataset into table format.

Screenshot of file transforms search page

Learn more about datasets.

Configure a transform

Complete the transform configuration board with required information, including columns, expressions, or values. In the example below, we chose the Rename columns transform, selected columns to rename, and entered new name values for the columns.

Screenshot of transforms configuration page

Apply a transform

After completing the transform form, click Apply to add the transform to your workflow. You will see the transform node connected to the origin dataset in your graph. We named our new transform Clean Facility Data, and it is a direct output of the original Facility dataset.

Screenshot of graph page with new transform

You can rename or edit the transform by clicking the transform node and selecting Edit.

Drag the white output circles on nodes to change connections on the graph.