Split transform

The Split transform is used to partition input data into two distinct outputs based on a specified condition. This transform evaluates each row of the input data against the defined condition and directs the rows to one of the two outputs accordingly.

An example Pipeline Builder graph with the Split transform

To use the Split transform, select any dataset node in your graph and select Split.

Screenshot of the Split transform in Pipeline Builder

Condition

The Split transform allows you to define a condition that determines how the input data is divided. This condition is a logical expression that evaluates to either True or False.

Outputs

The True output will contain rows for which the condition evaluates to true. These rows are directed to the first output.

The False output will contain rows for which the condition evaluates to false. These rows are directed to the second output.

To preview the outputs, you can use the dropdown in the top left of the bottom panel. Select True to see the True output or False to see the False output.

Screenshot of an example Split transform preview table

To use the outputs in downstream transforms, select the transform, and then select either the True or False output to use as your next input.

Screenshot of the Select input Split transform window

Example

Consider a dataset of customer orders. You want to separate orders into two categories: high-value and low-value orders, based on a threshold value.

In this case, the Condition will be order_value > 1000.

The True output will contain orders where order_value exceeds 1000, and the False output will contain orders where order_value does not exceed 1000.

Screenshot of an example Split transform configuration