H2O Isolation Forest Predictor

This node applies an Isolation Forest model to an input dataset in order to predict anomalies or outliers. The output of the node will consist of the input and, depending on the settings, one or two appended columns. One is the prediction which contains normalized anomaly score. The higher the score, the more likely it is an anomaly. The other (optionally) appended column contains the mean length of the predicted decision tree paths of each observation. The shorter, the more likely it is an anomaly.

Important note: All columns which have been used for training the model must be present in the incoming H2O frame as well.

Options

Settings

Prediction column name
Change the name of the prediction column.
Append column containing mean length
Select to append an extra column that contains the mean length of the predicted decision tree paths for each observation.
Mean length column name
Change the name of the created column that contains the mean length.

Input Ports

Icon
H2O Isolation Forest model, e.g. an Isolation Forest model.
Icon
H2O frame with data that is predicted

Output Ports

Icon
H2O Frame with the predictions.

Views

This node has no views

Workflows

Links

Developers

You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.