Regression trainer

The regression trainer trains a number of models in parallel or sequentially to help it determine the best performing model. This trainer may also take advantage of ensembling if it determines that a weighted ensemble of models performs better than any single model. This trainer is built using AutoGluon's ↗ TabularPredictor class.

The regression trainer may also perform stacking or bagging to further improve performance. Stacking is an ensembling process where the predictions produced by the set of models trained on the input data are used to train a further "layer" of models, and this process may be repeated. Bagging is an internal optimization method where each model architecture is trained on multiple random samples of the data and then outputs are combined in an ensemble. Stacking and bagging is controlled by the Training preset parameter.

The structure of the model ensemble can be viewed on the output model's experiment page under Plots.

Models trained

Internally, multiple models will be trained unless disabled by excluding the model or with the use of hyperparameters.

The following types of models are available for training:

GBM: LightGBM ↗ gradient boosting model
CAT: CatBoost ↗ model
XGB: XGBoost ↗ model
RF: Random Forest ↗ model
XT: Extra Trees ↗ model
KNN: KNearestNeighbors ↗ model
LR: Linear ↗ model
NN_TORCH: PyTorch ↗ model designed for tabular regression
FASTAI: fast.ai ↗ model designed for tabular regression

While some of these models are strictly classification models, they can still be applied in regression when stacking is used.

Parameters

Evaluation metric: The metric used to score the models during training performance. Different datasets may perform better on different evaluation metrics. Refer to the in-platform documentation for more details. For the regression evaluation metrics that measure error (such as MAE, MSE, or RMSE), lower values indicate better performance. However, for metrics such as R² and Pearson correlation, higher values indicate better model performance.
Training preset: Training presets are an abstraction above more complex arguments around bagging and stacking.
- best_quality: Enables stacking and bagging. This should be used when the best possible model is required, even at the cost of training and inference speed.
- high_quality: Enables stacking and bagging. Training time and inference time should be faster than the best_quality preset, but the model may be slightly less accurate.
- good_quality: Enables stacking and bagging. This preset has a faster training and inference speed than the previous presets, with decent predictive accuracy.
- medium_quality: Disables stacking and bagging. This preset has the fastest training time, but moderate predictive accuracy. This should be used for prototyping.
Training and inference limits:
- Training time limit: Optionally set the maximum amount of time spent training. This time limit may be respected on a best effort basis when using presets that enable stacking and bagging. When the time limit is reached, the training portion of the job will end. Note that the job will not finish until model validation is complete and the model has been published.
- Inference time limit: Optionally set the maximum amount of time that a given model may run during inference. Any model that surpasses this time limit will be skipped. We do not recommend setting this value unless strict inference time limits are required.
Excluded model types: Excludes certain model types from being used during training.
Prediction column name: An override for the prediction column name. By default, the target column name will be used.

Advanced: Stacking configuration

The optional stacking configuration fields allow for deeper control over how the model ensemble is constructed:

Fit weighted ensemble: When enabled, a WeightedEnsemble model will be produced at each stacking layer. We recommend enabling this for improved model performance.
Fit last ensemble with all models: When enabled, the WeightedEnsemble of the last stacking layer will be fit with models from all previous layers as base models. This value has no effect when using a preset that disables stacking. We recommend enabling this for improved model performance.
Fit full weighted ensemble: When enabled, a secondary WeightedEnsemble model will be produced on top of the weighted ensembles produced when fit last ensemble with all models is enabled. This option has no effect if fit last ensemble with all models is disabled, or when using a preset that disables stacking.

Advanced: Hyperparameters

The optional hyperparameter field allows for deeper customization and control over each trained model, with one caveat. When this field is defined, any model not supplied in the hyperparameter field will be ignored. This field will override the default hyperparameters chosen by AutoGluon, and can produce poor results. For this reason, we recommend avoiding this field if you have not consulted the AutoGluon documentation ↗. In the majority of cases, the default hyperparameters provide strong enough results.

In general, the arguments passed here will be sent directly to the underlying model implementation.

Take the example below:

Copied!1
2
3
4
5
6
{
    "GBM": [
        {"extra_trees": true}, {}
    ],
    "NN_TORCH": {}
}

These hyperparameters will enforce that the following models are trained:

A GBM model with the argument extra_trees set to true.
A GBM model with default arguments.
A PyTorch model with default arguments.

With the provided hyperparameters, these are the only models that will be trained before any ensembling or stacking operations are applied.

Outputs

The regression trainer will output a Foundry model that contains the best model as determined by the validation steps. Details about the model can be accessed by navigating to the experiment, which will contain parameters, metrics, and plots that provide insight into the model's performance.

←

PREVIOUSTime series forecasting

NEXTClassification

→