Corpus Creator

Go to Product

This nodes creates a document corpus which contains the occurrence count of each given token in regard to every document in the corpus. This node is intended to be connected to an “N-Gram Extractor” node (this means, a “token” can be a word, word-n-gram, or token-n-gram).

Options

Term input
Input column which contains the tokens as a collection cell.

Input Ports

Icon
Input table with a collection column which contains each document’s terms.

Output Ports

Icon
A table with a row for each token and its corresponding document count (i.e. the number of input documents which contain the given term)

Views

This node has no views

Workflows

Further Links

Developers

You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.