MLflow runtime for MLServer

This package provides a MLServer runtime compatible with MLflow models.

Usage

You can install the runtime, alongside mlserver, as:

pip install mlserver mlserver-mlflow

Content Types

The MLflow inference runtime introduces a new dict content type, which decodes an incoming V2 request as a dictionary of tensors. This is useful for certain MLflow-serialised models, which will expect that the model inputs are serialised in this format.

Note

The dict content type can be stacked with other content types, like np. This allows the user to use a different set of content types to decode each of the dict entries.