RDKit Diversity Picker

Picks diverse rows from an input table based on tanimoto distance between fingerprints. The picking is done using the MaxMin algorithm (Ashton, M. et. al., Quant. Struct.-Act. Relat., 21 (2002), 598-604). The algorithm is quite fast, even for large datasets, but note that runtime increases rapidly with the number of rows to be picked.

Options

Molecule or fingerprint column (table 1): The column containing the molecules or fingerprints to pick from. If molecules are selected their fingerprints will be calculated automatically with Morgan, Radius 2, 2048 bit length.
Molecule or fingerprint column to bias away from (table 2): The column containing molecules or fingerprints to bias away from. This option has the effect of seeding the diversity pick: Molecules selected will be diverse with respect to these biasing molecules as well as each other. If molecules are provided as input their fingerprints will be calculated automatically based on input of table 1. If table 1 has fingerprints with unknown settings this calculation will fail. In this case please regenerate fingerprints in table 1 with the RDKit Fingerprint Node or select a compatible fingerprint column in table 2 instead of a molecule column.
Number to pick: Number of diverse rows to pick.
Random seed: Random number seed to use.

Input Ports

: Table with either molecule or fingerprints for diversity picking.
: Table with either molecules or fingerprints to bias away from.

Output Ports

: The results of the diversity pick.

Popular Predecessors

Table Row to Variable4 %
Hierarchical Clustering2 %
Column Resorter2 %
File Reader2 %
Table Column to Variable2 %
Show all 155 recommendations

Popular Successors

2D/3D Scatterplot4 %
RDKit Canon SMILES2 %
SDF Writer2 %
MoSS MCSS Molecule Similarity2 %
Reference Row Filter2 %
Fingerprint Similarity1 %
CASE Switch Data (End)1 %
Excel Writer (XLS)1 %
Excel Writer1 %
RDKit Add Hs1 %
RDKit From Molecule1 %
RDKit Two Component Reaction1 %
Configurable End IF/CASE1 %
MolConverter1 %
Cell Replacer1 %
Partitioning1 %
Sorter1 %
Distance Matrix Calculate1 %
Similarity Search1 %
RDKit Chemical Transformation1 %
RDKit Find Murcko Scaffolds1 %
RDKit Molecule to SVG1 %
Lookup and Add Columns1 %
Similarity Matrix (from Molecules)1 %
CSV Writer1 %
Loop End1 %
Chunk Loop Start1 %
Random Forest Predictor1 %
Concatenate1 %
Column to Grid1 %
Constant Value Column1 %
Column Filter1 %
Row Filter1 %
Reference Row Splitter1 %
GroupBy1 %
Joiner1 %
String Manipulation1 %
Renderer to Image1 %
End IF1 %
IF Switch1 %
Scatter Plot (JFreeChart)1 %
Scatter Plot1 %
Fingerprint Similarity1 %
RDKit Descriptor Calculation1 %
RDKit Diversity Picker1 %
RDKit Interactive Table1 %
RDKit Fingerprint1 %
RDKit To InChI1 %
RDKit To Molecule1 %
RDKit Substructure Filter1 %
Principal Moment of Intertia (PMI)-Derived Properties< 1 %
Advanced MolConverter< 1 %
Activity Cliffs Viewer< 1 %
Loop End< 1 %
Joiner< 1 %
Rule-based Row Splitter< 1 %
Save Workflow< 1 %
MoSS< 1 %
RDKit Molecule Substructure Filter< 1 %
Polar Surface Area< 1 %
k-Means< 1 %
Cell Splitter< 1 %
Correlation Filter< 1 %
Interactive HiLite Collector< 1 %
Column Rename< 1 %
Shuffle< 1 %
Statistics< 1 %
Interactive Table< 1 %
RDKit Molecule Fragmenter< 1 %
RDKit RMSD Filter< 1 %
ActMolecule from Molecule< 1 %
~~Write MDB~~< 1 %
Canvas Molecular Descriptors< 1 %
PMI Triangle Scatter Plot< 1 %
Speedy SMILES Heavy Atom Count (HAC)< 1 %
MCS< 1 %
Similarity Viewer< 1 %
Variable to Table Column< 1 %
Hierarchical Clustering (DistMatrix)< 1 %
Molecule Type Cast< 1 %
Mol2 Writer< 1 %
Fingerprint Bayesian Learner< 1 %
Counter Generation< 1 %
Table to PDF< 1 %
Table to PDF< 1 %
HeatMap (JFreeChart)< 1 %
XLS Writer< 1 %
Table View< 1 %
JavaScript Table View< 1 %
Tile View< 1 %
RDKit Molecule Extractor< 1 %
Overlay Complexes< 1 %
Substructure Counter< 1 %
Write ASCII< 1 %
Spark Fragment Selector< 1 %
Torch/Forge Molecule Viewer< 1 %
Fingerprint Similarity< 1 %
R-Group Decomposer< 1 %
SDF Writer< 1 %
SMILES Writer< 1 %
Data to Report< 1 %
Canvas Fingerprint Generation< 1 %
Molecule-to-MAE< 1 %
Spotfire File Writer< 1 %
PMI Kernel Density Plot< 1 %
~~PMI Calculation~~< 1 %
Apply Transforms (RDKit) (Experimental)< 1 %
Chemical Terms< 1 %
MolExporter< 1 %
MarvinView< 1 %
CheS-Mapper< 1 %
Fingerprints Expander< 1 %
Catch Errors (Data Ports)< 1 %
CSV Writer (deprecated)< 1 %
Table Writer< 1 %
Recursive Loop End (2 ports)< 1 %
Naive Bayes Predictor< 1 %
Hierarchical Clustering< 1 %
PCA< 1 %
Random Forest Predictor (Regression)< 1 %
Concatenate (Optional in)< 1 %
Auto-Binner< 1 %
Column Auto Type Cast< 1 %
Number To String (deprecated)< 1 %
Column Resorter< 1 %
Linear Correlation< 1 %
Duplicate Row Filter< 1 %
Reference Column Filter< 1 %
Filter Apply< 1 %
HiLite Row Splitter< 1 %
Row Splitter< 1 %
Normalizer (Apply)< 1 %
Round Double< 1 %
RowID< 1 %
Row Sampling< 1 %
Set Operator< 1 %
Column Splitter< 1 %
Top k Selector< 1 %
Ungroup< 1 %
Value Counter< 1 %
Rule Engine (Dictionary)< 1 %
Data Explorer< 1 %
Histogram (interactive)< 1 %
Pie chart< 1 %
Scatter Plot (local)< 1 %
Scatter Matrix< 1 %
Color Manager< 1 %
Molfile Writer< 1 %
Smiles Directory Writer< 1 %
One Row to Many< 1 %
Histogram (JavaScript)< 1 %
OpenBabel< 1 %
Math Formula< 1 %
Excel Writer (XLS)< 1 %
Java Snippet (simple)< 1 %
Table Difference Checker< 1 %
Structure Converter< 1 %
CDK to Molecule< 1 %
2D Coordinates< 1 %
Depiction< 1 %
Lipinski's Rule-of-Five< 1 %
Fingerprints< 1 %
3D Viewer< 1 %
RDKit Generate Coords< 1 %
~~RDKit Fingerprint Writer~~< 1 %
~~RDKit Functional Group Filter~~< 1 %
RDKit Molecule Catalog Filter< 1 %
RDKit Open 3D Alignment< 1 %
RDKit Optimize Geometry< 1 %
RDKit Count-Based Fingerprint< 1 %
RDKit Salt Stripper< 1 %
ChEMBLdb Connector Input< 1 %

Views

This node has no views

Workflows

Developers

You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.

Installation

To use this node in KNIME, install the extension RDKit Nodes Feature from the below update site following our NodePit Product and Node Installation Guide:

v5.12

Plugin provider: Novartis

Plugin version: 6.0.0.v202607101001

On NodePit since: 2026-07-06

Last update: 2026-08-01

Tags: Modern UI

KNIME versions: v5.12, v5.11, v5.10, v5.9, v5.8, v5.7, v5.6, v5.5, v5.4, v5.3, v5.2, v5.1, v4.7, v4.6, v4.5, v4.4, v4.3, v4.2, v4.1, v4.0, v3.7, v3.6

Deploy, schedule, execute, and monitor your KNIME workflows locally, in the cloud or on-premises – with our brand new NodePit Runner.

Try NodePit Runner!