DB Binner

Groups numeric columns into discrete intervals, known as bins. This node supports both automatically binning to equal-width intervals and a manual binning method. Each bin is assigned a label based on its position, boundary values, or midpoint. The resulting binned values are output as string-type columns, either replacing the original data or appended alongside it.

Options

Columns to bin
Select the numeric columns to apply binning to. Only the selected columns will be transformed.
Binning type
Select the method used to define bin intervals.
  • Equal width: Creates bins of equal size across the value range. Requires a specified number of bins.
  • Custom cutoffs: Manually define bin edges using a list of cutoff values. At least two cutoffs are required.
Number of bins
Specifies the number of bins to create. Used with Equal width and Equal frequency binning.
Custom cutoffs
Specifies the exact value at which the bin boundary should be placed.
  • Cutoff: Specifies the exact value at which the bin boundary should be placed.
  • Exact match: Defines how to assign values that match a cutoff exactly.
    • To lower bin: Assign to the bin below the cutoff.
    • To upper bin: Assign to the bin above the cutoff.
Enforce integer cutoffs
If enabled, bin boundaries will be rounded to the nearest integer. Not available when using custom cutoffs.
Bin names/values
Selects how bins are labeled in the output. Use a 'Number Format Manager' node to customize number formatting.
  • Numbered (e.g., Bin 1, Bin 2): Bins are labeled by index.
  • Borders (e.g., [0.0, 1.0)): Bins are labeled using interval borders.
  • Midpoints: Bins are labeled using the midpoint of each interval.
Prefix
Text to prepend to bin labels when using numbered naming (e.g., Bin 1, Bin 2).
Lower outlier name/value
Label assigned to values below the first.
Upper outlier name/value
Label assigned to values above the last bin.
Output columns
Whether to replace the original columns with the binned columns, or to append the binned columns with new names created by appending the specified suffix to the original column names.
  • Replace: Replaces the original column with the binned result.
  • Append with suffix: Adds a new column with the binned values.
Suffix
Text appended to column names when new columns are created using the Append option.
Number formatting
Determine which number format should be applied if bin naming is either 'Borders' or 'Midpoints'. Choose Advanced to customize the number format using additional settings.
Minimum number of decimals
The formatted number will show trailing zeros until the requested number of decimals are reached. For instance, 1.2 with 2 decimals is shown as 1.20.
Maximum number of decimals
The formatted number is rounded to the given number of decimals. For instance, 1.2599 with 2 decimals is shown as 1.26.
Group separator
The separator between groups of integer digits. For instance, 1,000,000 uses a comma and 1'000'000 uses an apostrophe as group separator.
Decimal separator
The decimal separator to use.
Always show decimal separator
Whether to always show the decimal separator, even if no fractional digits are present. If enabled, output like 1. can be generated. If disabled, the same input would be shown as 1.

Input Ports

Icon
DB Data to define bins for.

Output Ports

Icon
DB Data with bins defined.
Icon
PMML document containing binning parameters.

Popular Predecessors

  • No recommendations found

Popular Successors

  • No recommendations found

Views

This node has no views

Workflows

  • No workflows found

Links

Developers

You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.