Keeps duplicates

Supported in: Batch

Keep duplicate rows from the input.

Transform categories: Other

Declared arguments

  • Column subset - If any columns are specified only those will be used when determining uniqueness.
    Set<Column<AnyType>>
  • Dataset - Dataset to keep duplicate rows from.
    Table

Examples

Example 1: Base case

Argument values:

  • Column subset: {tail_number}
  • Dataset: ri.foundry.main.dataset.aggregate

Input:

tail_numberairlinemilesfactor
XB-123foundry air1242
MT-222new airline11235
XB-123foundry airline3355
MT-222new air5654
KK-452new air2221
XB-123foundry airline11343

Output:

tail_numberairlinemilesfactor
XB-123foundry air1242
MT-222new airline11235
XB-123foundry airline3355
MT-222new air5654
XB-123foundry airline11343

Example 2: Base case

Description: No subset looks for exact duplicates. Argument values:

  • Column subset: {}
  • Dataset: ri.foundry.main.dataset.aggregate

Input:

tail_numberairlinemilesfactor
XB-123foundry air1242
XB-123foundry air1242
XB-123foundry air1242
MT-222new airline11236
MT-222new airline11235

Output:

tail_numberairlinemilesfactor
XB-123foundry air1242
XB-123foundry air1242
XB-123foundry air1242

Example 3: Null case

Argument values:

  • Column subset: {tail_number}
  • Dataset: ri.foundry.main.dataset.aggregate

Input:

tail_numberairlinemilesfactor
nullfoundry air1242
nullnew airline11235
nullfoundry airline3355
MT-222new air5654
KK-452new air2221
XB-123foundry airline11343

Output:

tail_numberairlinemilesfactor
nullfoundry air1242
nullnew airline11235
nullfoundry airline3355