DB Auto-Binner

This node allows to group numeric data in intervals - called bins. Please use the "Database Numeric-Binner" node if you want to define custom bins.

Options

Auto Binner Settings

Column Selection:
Columns in the include list are processed separately. The columns in the exclude list are omitted by the node.
Please note, that if the underlying database does not support "CASE" statements, only one column will be selectable. You can enable "CASE" support in the Advanced tab of the DB Connector node
Number of bins
The number of equidistant bins to create.
Bin Naming:
Use Numbered for bins labeled by an integer with prefix "Bin", Borders for labels using '"(a,b]"' interval notation or Midpoints for labels that show the midpoint of the interval.
Force integer bounds
Forces the bounds of the interval to be integers. The decimal bounds will be converted so that the lower bound of the first interval will be the floor of the lowest value and the upper bound of the last interval will be the ceiling of the highest value. The edges that separate the intervals will be the ceiling of the decimal edges. Duplicates of edges will be removed.

Examples:
[0.1,0.9], (0.9,1.8] -> [0,1], (1,2]
[3.9,4.1], (4.1,4.9], (4.9,5.1] -> [3,5], (5,6]
Replace target column(s):
If set the columns in the include list are replaced by the binned columns otherwise columns named with suffix '[binned]' are appended.

Number Format Settings

Advanced formatting
If enabled the format of the doubles in the labels can be configured by the options in this tab.
Output format
Specify the output format. The number 0.00000035239 will be displayed as 3.52E-7 with Standard String , 0.000000352 with Plain String (no exponent) and 352E-9 with Engineering String .
Precision
The scale of the double values to round to. If the scale is reduced the specified rounding mode is applied.
Precision mode
The type of precision to which the values are rounded. Decimal places, the default option rounds to the specified decimal places, whereas significant figures rounds to significant figures or numbers.
Rounding mode
The rounding mode which is applied when double values are rounded. The rounding mode specifies the rounding behavior. Seven different rounding modes are available:
  • UP: Rounding mode to round away from zero.
  • DOWN: Rounding mode to round towards zero.
  • CEILING: Rounding mode to round towards positive infinity.
  • FLOOR: Rounding mode to round towards negative infinity.
  • HALF_UP: Rounding mode to round towards "nearest neighbor" unless both neighbors are equidistant, in which case round up.
  • HALF_DOWN: Rounding mode to round towards "nearest neighbor" unless both neighbors are equidistant, in which case round down.
  • HALF_EVEN: Rounding mode to round towards the "nearest neighbor" unless both neighbors are equidistant, in which case, round towards the even neighbor.
For a detailed description of each rounding mode please see the Java documentation .

Input Ports

Icon
Data to define bins

Output Ports

Icon
Data with bins defined
Icon
PMML document containing binning parameters.

Popular Predecessors

Popular Successors

Views

This node has no views

Workflows

  • No workflows found

Links

Developers

You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.