Spark H2O MOJO Predictor (Word Embedding) (legacy)

This node applies a word embedding MOJO to an input column.

Note: The input will not be stripped, so leading or trailing whitespaces should be removed with a String Manipulation node before prediction if desired.

Options

Word Embedding Settings

Input column: The input column of type String. The content of the String cells will be treated as one word to predict, no tokenization will be performed.
Change prediction column name: Change the name of the prediction column.
Fail if a word is not in the vocabulary: If checked, the node will fail if any of the words in the input table is missing in the vocabulary of the MOJO. Otherwise, a missing value will be the output.

Spark Settings

Upload MOJO dependency: If checked, the MOJO dependency (genmodel jar file) will be uploaded to the cluster. Otherwise depend on cluster side provided dependency.

Input Ports

: The MOJO. Its model category must be word embedding.
: Spark DataFrame/RDD for prediction. Missing values will be treated as NA .

Output Ports

: Spark DataFrame/RDD containing the predicted vector.

Popular Predecessors

Popular Successors

No recommendations found

Views

This node has no views

Workflows

No workflows found

Developers

You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.

Installation

To use this node in KNIME, install the extension KNIME Extension for MOJO nodes on Spark (Legacy) from the below update site following our NodePit Product and Node Installation Guide:

v5.6

A zipped version of the software site can be downloaded here.

Plugin provider: KNIME AG, Zurich, Switzerland

Plugin version: 5.6.0.v202507151412

On NodePit since: 2025-08-15

Last update: 2025-08-16

KNIME versions: Since v3.6

Deploy, schedule, execute, and monitor your KNIME workflows locally, in the cloud or on-premises – with our brand new NodePit Runner.

Try NodePit Runner!