0 ×

Term Document Entropy

KNIME Textprocessing Plug-in version 4.3.0.v202011212014 by KNIME AG, Zurich, Switzerland

This node computes the informational entropy of each term in each document. The nodes requires a bag of words table as input and appends an additional column to the output table, containing the entropy values. If a term occurs once in every document, its entropy for each document is 0. Any other combination of frequencies determines an entropy weight between 0 and 1. Please note, that the computational complexity of of the entropy calculation is greater than the number of terms times the number of documents. For big bag of words input tables the computation can be quite time consuming.

Options

Document selection

Document Column
Specifies the document column to use for frequency counting.

Input Ports

Icon
The input table which contains terms and documents.

Output Ports

Icon
The output table with terms, documents and a corresponding entropy value.

Best Friends (Incoming)

Best Friends (Outgoing)

Workflows

Installation

To use this node in KNIME, install KNIME Textprocessing from the following update site:

KNIME 4.3

A zipped version of the software site can be downloaded here.

You don't know what to do with this link? Read our NodePit Product and Node Installation Guide that explains you in detail how to install nodes to your KNIME Analytics Platform.

Wait a sec! You want to explore and install nodes even faster? We highly recommend our NodePit for KNIME extension for your KNIME Analytics Platform. Browse NodePit from within KNIME, install nodes with just one click and share your workflows with NodePit Space.

Developers

You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.