Low Variance Filter

Filters out double-compatible columns, whose variance is below a user defined threshold. Columns with low variance are likely to distract certain learning algorithms (in particular those which are distance based) and are therefore better removed.

Note, the input table should not be normalized with a Gaussian normalization or any other normalization technique which changes the variances of the input.

Options

Variance upper bound
Choose a variance value here. The higher the value, the more columns are likely to get filtered out. Choose 0 to filter for columns, which only contain one constant value.
Columns to include
This list contains the names of the columns that are considered for filtering. Any other column will be left untouched (i.e. will also be present in the output table independent of their variance). Only double-compatible columns can be selected.

Input Ports

Icon
Numeric input data. (Non-numeric columns will be left untouched.)

Output Ports

Icon
Filtered data.

Views

This node has no views

Workflows

Links

Developers

You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.