Statistics (Labs)

A statistics node. The view can be accessed either via the "interactive view" action on the executed node or on KNIME WebPortal.
In the node configuration, you can choose the columns you want to display. Only numeric and text based columns are supported.The configuration also offers a preview of the table, which should help to get the table view fast in the desired shape.This node accepts either numeric or nominal based columns, while all other types will be ignored. Note that in case nominal columns, the calculated numeric statistics refer to the string length.All statistics calculation will ignore missing and NaN values.For first and last element in a nominal dimension the lexicographic order will be used.

Note, this node is currently under development. Future versions of the node might have improved or changed functionality.

Options

Selected Columns
Selects the columns to calculate statistics from.
Displayed Statistics
Select the statistics that should be displayed in the table:
Note: Missing values and (in case of numeric columns) NaN values are ignored.
  • Name: When selected, a column with the dimension name is displayed.
  • Type: When selected, a column with the assigned data type of this column is displayed.
  • Number unique values: When selected, a column shows the number of unique values this column has.
  • Minimum: When selected, a column "Minimum" listing the minimum for each selected column is displayed. Note that for nominal columns the minimum is taken with respect to the lexicographic order.
  • Maximum: When selected, a column "Maximum" listing the maximum for each selected column is displayed. Note that for nominal columns the maximum is taken with respect to the lexicographic order.
  • Common values: When selected, a column "10 most common values" is displayed. For each column the 10 most common values are displayed together with the absolute and proportional frequency of occurrence (e.g., 'A (120, 24%), B (50, 10%), ...').
Note: For the following properties, the string length is used in case of nominal columns.
  • Quantiles: One can choose between several p-quantiles with values p = 1%, 5%, 10%, 25%, 50%, 75%, 90%, 95% and 99%. When selected, a column with the respective dimension quantile is displayed.
  • Mean: When selected, a column with the dimension mean is displayed.
  • Sum: When selected, a column with the dimension sum is displayed.
  • Mean absoulte deviation: When selected, a column with the dimension mean absolute deviation is displayed.
  • Standard deviation: When selected, a column with the dimension standard deviation is displayed.
  • Variance: When selected, a column with the dimension variance is displayed.
  • Skewness: When selected, a column with the dimension skewness is displayed.
  • Kurtosis: When selected, a column with the dimension kurtosis is displayed.

Input Ports

Icon
Data table with data to display.

Output Ports

This node has no output ports

Popular Predecessors

  • No recommendations found

Popular Successors

  • No recommendations found

Views

Statistics (Labs)
A table view of statistics from the data.

Workflows

  • No workflows found

Links

Developers

You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.