Database Sampling

This Node Is Deprecated — This version of the node has been replaced with a new and improved version. The old version is kept for backwards-compatibility, but for all new workflows we suggest to use the version linked below.
Go to Suggested ReplacementDB Row Sampling

This node is part of the deprecated database framework. For more information on how to migrate to the new database framework see the migration section of the database documentation.

This node extracts a sample (a bunch of rows) from the input data of a database. The dialog enables you to specify the sample size. The following options are available in the dialog:

Options

Absolute
Specify the absolute number of rows in the sample. If there are less rows than specified here, all rows are used.
Relative
The percentage of the number of rows in the sample. Must be between 0 and 100, inclusively.
Take from top
This mode selects the top most rows of the table. Note that this depends on the implementation of the connected database.
Draw randomly
Random sampling of all rows if connected database will support random sampling. Note that this method might be very slow for large database tables.
Stratified sampling
Check this button if you want stratified sampling, i.e. the distribution of values in the selected column is (approximately) retained in the output table.

Input Ports

Icon
Table in database to apply database sampling

Output Ports

Icon
Table in the database with sampled rows

Views

This node has no views

Workflows

Links

Developers

You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.