Subgraphs in Pipeline Builder are in the beta phase of development and may not be available on your enrollment. Functionality may change during active development.
The subgraph feature in Pipeline Builder lets you package one or more transforms into a reusable block. You can then apply that block anywhere in a pipeline without rebuilding the logic each time. Subgraphs offer several benefits:
This guide covers how to:
You create and manage reusable subgraphs from the Reusables menu.
In Pipeline Builder, open the pipeline where you want to define the subgraph.
At the top of the graph, select Reusables, then select Subgraphs.

In the Subgraphs window, select Add custom subgraph.

A new, empty subgraph opens.
Select Add input in the center of the canvas.

An Input node appears. Select Edit beneath it.

In the right-side pane, configure the Required columns for this input. You can either add columns manually or copy a schema from an existing dataset:

If you select From dataset, choose the dataset whose schema you want to use, then confirm with Select. The input node now expects the columns from the chosen dataset.

With inputs defined, you can build out the subgraph just like any other pipeline:

When you are done, select Apply all subgraph changes in the top right. Give the subgraph a clear name. In this example, the subgraph is named Manually created subgraph. Select Close to exit the subgraph editor. Your subgraph is now available as a reusable block.

You can add a subgraph after any node whose output matches the subgraph's input schema. There are two entry points:


Select the pipeline node where the subgraph should start.
Select Apply subgraph to table, or select Subgraph from the node menu.
In the banner that appears, open the Select a subgraph dropdown and choose your subgraph.

Map the subgraph's Input to the selected node in the graph. Pipeline Builder maps this automatically if you start from that node. Select Start.

The subgraph configuration panel opens:
Review the Outputs tab to confirm the resulting schema, then select Apply and Close.

If the subgraph's input does not contain all the required input columns, schema errors appear in the Errors tab and on the pipeline graph.

You might already have a sequence of nodes that you want to reuse. You can convert those nodes directly into a subgraph, or replace them with a subgraph you have already created.

A new subgraph editor opens containing the selected nodes, along with automatically created Input and Output nodes with populated schemas.

When the subgraph is valid and no schema errors appear, select Apply all subgraph changes.
When your subgraph has multiple inputs, you map each input separately when you apply it.
Student to a rename transform and Class to a type-conversion transform. Select Start.
The subgraph configuration window opens, showing the mapped inputs and a preview of the outputs. Confirm the output looks correct, then select Apply and Close.

Back on the main canvas, the subgraph appears as a single node with its output schema, such as class_id and avg_score, visible in the preview panel.

You can then rewire connected nodes to use the subgraph output, and replace duplicated logic elsewhere with additional instances of the same subgraph.