0 ×

Row Sampling

KNIME Base Nodes version 4.0.0.v201906241150 by KNIME AG, Zurich, Switzerland

This node extracts a sample (a bunch of rows) from the input data. The dialog enables you to specify the sample size. The following options are available in the dialog:

Options

Absolute
Specify the absolute number of rows in the sample. If there are less rows than specified here, all rows are used.
Relative
The percentage of the number of rows in the sample. Must be between 0 and 100, inclusively.
Take from top
This mode selects the top most rows of the table.
Linear sampling
This mode always includes the first and the last row and selects the remaining rows linearly over the whole table (e.g. every third row). This is useful to downsample a sorted column while maintaining minimum and maximum value.
Draw randomly
Random sampling of all rows, you may optionally specify a fixed seed (see below).
Stratified sampling
Check this button if you want stratified sampling, i.e. the distribution of values in the selected column is (approximately) retained in the output table. You may optionally specify a fixed seed (see below).
Use random seed
If either random or stratified sampling is selected, you may enter a fixed seed here in order to get reproducible results upon re-execution. If you do not specify a seed, a new random seed is taken for each execution.

Input Ports

Table to sample from.

Output Ports

The sampled table.

Best Friends (Incoming)

Best Friends (Outgoing)

Workflows

Installation

To use this node in KNIME, install KNIME Base Nodes from the following update site:

KNIME 4.0
Wait a sec! You want to explore and install nodes even faster? We highly recommend our NodePit for KNIME extension for your KNIME Analytics Platform.