0 ×


KNIME Textprocessing Plug-in version 4.3.0.v202011212014 by KNIME AG, Zurich, Switzerland

Computes three variants of the inverse document frequency (idf) for each term according to the given set of documents and adds a column containing the idf value. Smooth, normalized, and probabilistic idf. The default variant is smooth idf specified as follows: idf(t) = log(1 + (f(D) / f(d, t))).
The normalized idf is defined by: idf(t) = log(f(D) / f(d,t)).
The probabilistic idf is defined by: idf(t) = log((f(D) - f(d,t)) / f(d,t)), where f(D) is the number of all documents and f(d,t) is the number of documents containing term t.


Frequency options

IDF variant
Choose which variant of the inverse document frequency to compute. Default is smooth idf.

Document selection

Document Column
Specifies the document column to use for frequency counting.

Input Ports

The input table which contains terms and documents.

Output Ports

The output table which contains terms documents and a corresponding frequency value.

Best Friends (Incoming)

Best Friends (Outgoing)



To use this node in KNIME, install KNIME Textprocessing from the following update site:


A zipped version of the software site can be downloaded here.

You don't know what to do with this link? Read our NodePit Product and Node Installation Guide that explains you in detail how to install nodes to your KNIME Analytics Platform.

Wait a sec! You want to explore and install nodes even faster? We highly recommend our NodePit for KNIME extension for your KNIME Analytics Platform. Browse NodePit from within KNIME, install nodes with just one click and share your workflows with NodePit Space.


You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.