Preparation

Preparation has been superseded by Pipeline Builder and is therefore no longer the recommended approach for cleaning and preparing data. Pipeline Builder makes it easy to clean and prepare your data for pipelines, while also offering Marketplace support.

Preparation is an interactive tool for cleaning and preparing data. Cleaning refers to fixing data quality issues, and preparing refers to manipulating data to make it usable for a specific analytic task.

The dataset shown below comes from The Meteoritical Society via the NASA Data Portal ↗.

Sample of Preparation cleaning workflow

Terminology

The following terms are useful to know before using the Preparation tool:

  • Preparation: A cleaning/preparation session

  • Clean: Fix data quality issues in a dataset

  • Prepare: Adapt a dataset for a specific use

  • Change: An individual clean/prepare step

  • Changelog: All the changes made within a preparation