0 ×

Corpus Creator

Palladian for KNIME version by palladian.ws

This nodes creates a document corpus which contains the occurrence count of each given token in regard to every document in the corpus. This node is intended to be connected to an “N-Gram Extractor” node (this means, a “token” can be a word, word-n-gram, or token-n-gram).


Term input
Input column which contains the tokens as a collection cell.

Input Ports

Input table with a collection column which contains each document’s terms.

Output Ports

A table with a row for each token and its corresponding document count (i.e. the number of input documents which contain the given term)

Best Friends (Incoming)

Best Friends (Outgoing)


To use this node in KNIME, install Palladian for KNIME from the following update site:


A zipped version of the software site can be downloaded here.

You don't know what to do with this link? Read our NodePit Product and Node Installation Guide that explains you in detail how to install nodes to your KNIME Analytics Platform.

Wait a sec! You want to explore and install nodes even faster? We highly recommend our NodePit for KNIME extension for your KNIME Analytics Platform. Browse NodePit from within KNIME, install nodes with just one click and share your workflows with NodePit Space.


You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.