API Documentation

Purpose of `moco`

moco takes machine learning models (or any "learned" function) and optimizes them for computational efficiency. moco consists of a suite of algorithms that optimize models for computational efficiency with no accuracy loss.

moco analyzes data to find early-exit strategies to exploit:

moco analyzes the input data. An example early-exit rule for a fraud detection model (when the features are interpretable) might be: "it's fraud if the user has logged in 4 times in the last 10 minutes."
Intermediate representations (often referred to as embeddings, activations, etc.)

moco analyzes the internal parameters (weights and biases) to find equivalent loss-less representations for models.

Example: pruning.

moco also analyzes the interaction of the parameters with the data/activations to find equivalent loss-less representations to represent models.

Form Factor

moco is a pip-installable Python package that offers:

A command-line interface
A software library with proprietary algorithms accessible via API
A client-side helper library that aggregates outputs from the API into a machine-runnable format

API Methods

Pruning

prune_neurons(model: MLPClassifier, data: np.ndarray) -> MLPClassifier
condense_neurons(model: MLPClassifier, data: np.ndarray) -> Tuple[MLPClassifier, LogisticRegression]

Notes:

model is uploaded as a .joblib file.
model.activation must be "relu".
condense_neurons returns two models whose summed output matches the original model.
prune_neurons returns one model whose output matches the original model.

Early-Exiting

Note that early-exiting involves an analysis step to do this step intentionally. Training a probe classifier on **any** given layer will have low probability of success.


        class RoutedModel:
            base: torch.nn.Module
            conditional: torch.nn.Module
            router: torch.nn.Module
            yes_path: torch.nn.Module
            no_path: torch.nn.Module

compute_rule(model: torch.nn.Sequential, layer_name: str, data: torch.utils.data.Dataset) -> RoutedModel

Notes

In many cases, yes_path is a simple constant function.

Utility

get_layers(model: torch.nn.Sequential) -> List[str]
profile_layers(model: torch.nn.Sequential, data: torch.utils.data.Dataset) -> List[Tuple[str, float]]

Compatibility