Corpus Creator

Go to Product

This nodes creates a document corpus which contains the occurrence count of each given token in regard to every document in the corpus. This node is intended to be connected to an “N-Gram Extractor” node (this means, a “token” can be a word, word-n-gram, or token-n-gram).

Options

Term input: Input column which contains the tokens as a collection cell.

Input Ports

: Input table with a collection column which contains each document’s terms.

Output Ports

: A table with a row for each token and its corresponding document count (i.e. the number of input documents which contain the given term)

Popular Predecessors

Cell Splitter8 %
NGram Creator8 %
Document Grabber7 %
N-Gram Extractor6 %
Table Creator6 %
Show all 42 recommendations

Popular Successors

Views

This node has no views

Workflows

Developers

You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.

Go to Product

Installation

To use this node in KNIME, install the extension Palladian for KNIME from the below update site following our NodePit Product and Node Installation Guide:

v5.12

A zipped version of the software site can be downloaded here.

Plugin provider: palladian.ws

Plugin version: 3.4.0.202601041906

On NodePit since: 2026-07-07

Last update: 2026-07-28

Tags: Streamable

KNIME versions: Since v3.6

NodePit ExclusiveOnly available on NodePit

Deploy, schedule, execute, and monitor your KNIME workflows locally, in the cloud or on-premises – with our brand new NodePit Runner.

Try NodePit Runner!