Publish a model from pre-trained files

Palantir enables the creation of a model that wrap weights produced outside of the platform. These files can include open-source model weights, models trained in a local development environment, models trained in the Code Workspaces application, and model weights from legacy systems.

Once a Palantir model has been created, Palantir provides the following:

Integration with batch pipelines and real-time model hosting.
Full versioning, granular permissioning, and governed model lineage.
Model management and live deployment via Modeling Objectives.
Binding to the Ontology, allowing for operationalization via functions on models and what-if scenario analysis.

Create a model from model files

To create a model from model files, you will need the following:

Model files that can be uploaded to Palantir
A model adapter that tells Palantir how to load and run inference with the model

1. Upload model files to an unstructured dataset

First, upload your model files to an unstructured dataset in the Palantir platform. Create a new dataset by selecting +New > Dataset in a Project.

Create a new unstructured dataset from a Project.

Then, select Import new data and choose the files from your computer to upload to the model.

Select Import new data from the center of the screen.

Choose files from your computer for a new unstructured dataset.

If required, you can upload many different files to the same dataset. The dataset will be unstructured, meaning it will not have a tabular schema.

Model files were successfully uploaded to an unstructured dataset.

2. Create a model training repository to define model adapter logic

Create a new code repository that will manage the logic for reading the model files from your unstructured dataset. The logic will wrap those files in a model adapter and publish them as a model. In the Code Repositories application, choose to initialize a Model Integration repository with the Model Training language template.

View the full documentation on the Model Training template and the model adapter API for reference.

The Initialize Repository page in the Code Repositories application.

3. Publish weights to a model

To publish model files in your unstructured dataset as a Palantir model, you must author a transform that completes the following:

Loads the saved model files into a proper Python object from the unstructured dataset
Instantiates a model adapter using that loaded Python object
Publishes the model adapter as a model resource

You can place the logic for loading and publishing a model within the model_training folder in the repository.

For additional information, we recommend reviewing the following documentation:

The full Model Adapter API definition
How to read files from an unstructured dataset
How to create and publish a model adapter with the Model Training template
An example wrapping of a locally-trained model

Once you have defined your model training logic, select Build to execute the logic to read the model files and publish a model.

Copied!1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
from transforms.api import transform, Input
from palantir_models.transforms import ModelOutput, copy_model_to_driver

import palantir_models as pm

import os
import pickle

@transform(
    model_files=Input("<Model Files Dataset>"),
    model_output=ModelOutput("<Your Model Path>")
)
def compute(model_files, model_output):
    # all the files from the dataset are copied onto the driver
    model_files_path = copy_model_to_driver(model_files.filesystem())

    # for example, if you had a model saved as in a pickle file, you would then load that using pickle
    with open(os.path.join(model_files_path, "model.pkl"), 'rb') as file:
        model = pickle.load(file)

    wrapped_model = ExampleModelAdapter(model)
    model_output.publish(
        model_adapter=wrapped_model
    )

class ExampleModelAdapter(pm.ModelAdapter):

    @pm.auto_serialize
    def __init__(self, model):
        self.model = model

    @classmethod
    def api(cls):
        pass  # Implement the API of this model

    def predict(self, df_in):
        pass # Implement the inference logic

The same file loading logic applies to most other cases, where libraries (such as PyTorch or Tensorflow) may provide methods for reading serialized files to Python objects.

The model training logic is ready to build in the code repository.

4. Consume the published model

Once you have successfully published a model, you can consume the model for inference. Use the following documentation for guidance:

←

PREVIOUSModels trained in Foundry / Upgrade Model Adapter Without Retraining

NEXTExample: Upload a scikit-learn model

→