0 ×

Spark H2O MOJO Predictor (Word Embedding)

KNIME Extension for MOJO nodes on Spark version 4.3.0.v202012011123 by KNIME AG, Zurich, Switzerland

This node applies a word embedding MOJO to an input column.

Note: The input will not be stripped, so leading or trailing whitespaces should be removed with a String Manipulation node before prediction if desired.

Options

Word Embedding Settings

Input column
The input column of type String. The content of the String cells will be treated as one word to predict, no tokenization will be performed.
Change prediction column name
Change the name of the prediction column.
Fail if a word is not in the vocabulary
If checked, the node will fail if any of the words in the input table is missing in the vocabulary of the MOJO. Otherwise, a missing value will be the output.

Spark Settings

Upload MOJO dependency
If checked, the MOJO dependency (genmodel jar file) will be uploaded to the cluster. Otherwise depend on cluster side provided dependency.

Input Ports

Icon
The MOJO. Its model category must be word embedding.
Icon
Spark DataFrame/RDD for prediction. Missing values will be treated as NA .

Output Ports

Icon
Spark DataFrame/RDD containing the predicted vector.

Best Friends (Incoming)

Installation

To use this node in KNIME, install KNIME Extension for MOJO nodes on Spark from the following update site:

KNIME 4.3

A zipped version of the software site can be downloaded here.

You don't know what to do with this link? Read our NodePit Product and Node Installation Guide that explains you in detail how to install nodes to your KNIME Analytics Platform.

Wait a sec! You want to explore and install nodes even faster? We highly recommend our NodePit for KNIME extension for your KNIME Analytics Platform. Browse NodePit from within KNIME, install nodes with just one click and share your workflows with NodePit Space.

Developers

You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.