Document Vector Hashing Applier

This node creates a document vector for each document representing it in the terms space. The values of the feature vectors will be boolean values or as values of either the relative frequency or the absolute frequency of the terms. This node needs a model input provided by the Document vector hashing node which contains the specifications for vector creation.

Options

Document column: The column containing the documents to use.
As collection cell: If checked all vector entries will be stored in a collection cell consisting of double cells. If not checked all double cells will be stored in corresponding columns. The advantage of the column representation is that most of the regular algorithms in KNIME can be applied. The disadvantage is (which is on the other hand the advantage of the collection representation) that processing of subsequent nodes will be slowed down, due to the many columns that will be created (dependent on the input data of course).

Input Ports

: The model containing the settings to create the document vector space.
: The input table containing the documents.

Output Ports

: An output table containing the input documents with the corresponding document vectors.

Views

This node has no views

Workflows

Developers

You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.

Installation

To use this node in KNIME, install the extension KNIME Textprocessing from the below update site following our NodePit Product and Node Installation Guide:

v5.9

A zipped version of the software site can be downloaded here.

Plugin provider: KNIME AG, Zurich, Switzerland

Plugin version: 5.9.0.v202511181600

On NodePit since: 2025-12-11

Last update: 2026-01-19

Tags: Streamable

KNIME versions: Since v3.6

Deploy, schedule, execute, and monitor your KNIME workflows locally, in the cloud or on-premises – with our brand new NodePit Runner.

Try NodePit Runner!