There are two ways to derive a set of output datasets from a set of input datasets with transforms:
A hybrid approach is also possible, where you have multi-output transforms, each having the same inputs and different sets of outputs. These options are explored in the following section.
A single output transform has X1, X2, ..., Xn
inputs and produces one output Y
. To obtain multiple outputs Y1, Y2, ..., Yn
, write multiple transforms taking these X
inputs, each writing to a different output. Each output has its own transform.
The following is a list that describes the advantages of having multiple single-output transforms.
Conversely, having having single-output transforms may also bring with it the following disadvantages.
Overall, this option comes with the most flexibility, but is less adapted to duplicate operations and might be more computationally expensive.
A multi-output transform has X1, X2, ... Xn
inputs and produces Y1, Y2, ... Yn
outputs.
We have a single transform for all outputs.
Consider the advantages of using multi-output transforms below:
Consider the disadvantages of using multi-output transforms below:
Multi-output transforms are less flexible, but they are well suited for repeated logic in outputs.
Things to consider when you are using multi-output transforms:
Single-output transforms are very flexible and well-suited to cases where the logic between outputs varies. Multi-output transforms are less flexible, but they can be more cost-effective under the right conditions. Generally, opt for multi-output transforms if you meet the following criteria:
If these conditions are met, you should opt for multi-output transforms. Otherwise, decide case by case while keeping in mind that multiple single-output transforms are the fallback option.