Repartition data

Supported in: Batch

Forces a shuffle of the data based on optionally provided partitioning columns and a resulting number of partitions. If these are not provided, the partitioning will be determined automatically.

Transform categories: Other

Declared arguments

  • Dataset - Dataset to perform aggregate on.
  • optional Number of partitions - Number of partitions to reshuffle to.
  • optional Partitioning columns - Specifies the list of columns to be used for repartitioning.