RDKit Diversity Picker

Picks diverse rows from an input table based on tanimoto distance between fingerprints. The picking is done using the MaxMin algorithm (Ashton, M. et. al., Quant. Struct.-Act. Relat., 21 (2002), 598-604). The algorithm is quite fast, even for large datasets, but note that runtime increases rapidly with the number of rows to be picked.

Options

Molecule or fingerprint column (table 1): The column containing the molecules or fingerprints to pick from. If molecules are selected their fingerprints will be calculated automatically with Morgan, Radius 2, 2048 bit length.
Molecule or fingerprint column to bias away from (table 2): The column containing molecules or fingerprints to bias away from. This option has the effect of seeding the diversity pick: Molecules selected will be diverse with respect to these biasing molecules as well as each other. If molecules are provided as input their fingerprints will be calculated automatically based on input of table 1. If table 1 has fingerprints with unknown settings this calculation will fail. In this case please regenerate fingerprints in table 1 with the RDKit Fingerprint Node or select a compatible fingerprint column in table 2 instead of a molecule column.
Number to pick: Number of diverse rows to pick.
Random seed: Random number seed to use.

Input Ports

: Table with either molecule or fingerprints for diversity picking
: Table with either molecules or fingerprints to bias away from

Output Ports

: The results of the diversity pick

Popular Predecessors

Popular Successors

Views

This node has no views

Workflows

Developers

You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.

Installation

To use this node in KNIME, install the extension RDKit Nodes Tests Feature from the below update site following our NodePit Product and Node Installation Guide:

v5.5

Plugin provider: Novartis

Plugin version: 5.2.1.v202507231837

On NodePit since: 2025-07-02

Last update: 2025-07-31

KNIME versions: v5.5, v5.4, v5.3, v5.2, v5.1, v4.7, v4.6, v4.5, v4.4, v4.3, v4.2, v4.1, v4.0, v3.7, v3.6

Deploy, schedule, execute, and monitor your KNIME workflows locally, in the cloud or on-premises – with our brand new NodePit Runner.

Try NodePit Runner!