Document Vector Hashing Applier

This node creates a document vector for each document representing it in the terms space. The values of the feature vectors will be boolean values or as values of either the relative frequency or the absolute frequency of the terms. This node needs a model input provided by the Document vector hashing node which contains the specifications for vector creation.


Document column
The column containing the documents to use.
As collection cell
If checked all vector entries will be stored in a collection cell consisting of double cells. If not checked all double cells will be stored in corresponding columns. The advantage of the column representation is that most of the regular algorithms in KNIME can be applied. The disadvantage is (which is on the other hand the advantage of the collection representation) that processing of subsequent nodes will be slowed down, due to the many columns that will be created (dependent on the input data of course).

Input Ports

The model containing the settings to create the document vector space.
The input table containing the documents.

Output Ports

An output table containing the input documents with the corresponding document vectors.


This node has no views




You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.