Icon

TeachOpenCADD_​Workflow4_​Similarity_​search

TeachOpenCADD Workflow 4: Ligand-based screening: Compound similarity

In virtual screening (VS), compounds similar to known ligands of a target under investigation often build the starting point for drug development. This approach follows the similar property principle stating that structurally similar compounds are more likely to exhibit similar biological activities (exceptions are so-called activity cliffs). For computational representation and processing, compound properties can be encoded in form of bit arrays, so-called molecular fingerprints, e.g. MACCS and Morgan fingerprints. Compound similarity can be assessed by measures such as the Tanimoto and Dice similarity.
This workflow shows how to use these encodings and comparison methods. VS is here conducted based on a similarity search.

4. Ligand-based screening: compound similarityIn virtual screening (VS), compounds similar to known ligands of a target under investigation often build the startingpoint for drug development. This approach follows the similar property principle stating that structurally similarcompounds are more likely to exhibit similar biological activities (exceptions are so-called activity cliffs). Forcomputational representation and processing, compound properties can be encoded in form of bit arrays, so-calledmolecular fingerprints, e.g. MACCS and Morgan fingerprints. Compound similarity can be assessed by measures suchas the Tanimoto and Dice similarity. The following steps show how to use these encodings and comparison methods.VS is here conducted based on a similarity search. Step 2Similarity search of query compound (example: gefitinib) against full dataset using Tanimoto/Dice similarity Step 1Calculate fingerprints for datasetand query compound Step 3Evaluate performance with enrichmentplots (split dataset into active andinactive compounds at pIC50 = 6.3) MACC fingerprints Morgan fingerprints This workflow is part of theTeachOpenCADD pipeline: https://hub.knime.com/volkamerlab/space/TeachOpenCADDRead more on the theoreticalbackground of this workflow on ourTeachOpenCADD platform: https://projects.volkamerlab.org/teachopencadd/talktorials/T004_compound_similarity.html DatasetDatasetTanimoto similarityDice similarityTanimoto similarityDice similarityTanimoto similarityDice similarityQueryQueryGefitinibSimilarity to queryList of compoundsList of compoundsDatasetDatasetTanimoto similarityNode 293Node 294RDKit Fingerprint RDKit Fingerprint Similarity Search Similarity Search Similarity Search Similarity Search Column Rename Column Rename Column Rename Column Rename EnrichmentPlotter (local) EnrichmentPlotter (local) RDKit Fingerprint RDKit Fingerprint Molecule Type Cast RDKit From Molecule Query compound Scatter plot CSV Reader Joiner Joiner Joiner CSV Reader Molecule Type Cast RDKit From Molecule RDKit Fingerprint RDKit Fingerprint Similarity Search Similarity Matrix(from Molecules) Smiles Reader 4. Ligand-based screening: compound similarityIn virtual screening (VS), compounds similar to known ligands of a target under investigation often build the startingpoint for drug development. This approach follows the similar property principle stating that structurally similarcompounds are more likely to exhibit similar biological activities (exceptions are so-called activity cliffs). Forcomputational representation and processing, compound properties can be encoded in form of bit arrays, so-calledmolecular fingerprints, e.g. MACCS and Morgan fingerprints. Compound similarity can be assessed by measures suchas the Tanimoto and Dice similarity. The following steps show how to use these encodings and comparison methods.VS is here conducted based on a similarity search. Step 2Similarity search of query compound (example: gefitinib) against full dataset using Tanimoto/Dice similarity Step 1Calculate fingerprints for datasetand query compound Step 3Evaluate performance with enrichmentplots (split dataset into active andinactive compounds at pIC50 = 6.3) MACC fingerprints Morgan fingerprints This workflow is part of theTeachOpenCADD pipeline: https://hub.knime.com/volkamerlab/space/TeachOpenCADDRead more on the theoreticalbackground of this workflow on ourTeachOpenCADD platform: https://projects.volkamerlab.org/teachopencadd/talktorials/T004_compound_similarity.html DatasetDatasetTanimoto similarityDice similarityTanimoto similarityDice similarityTanimoto similarityDice similarityQueryQueryGefitinibSimilarity to queryList of compoundsList of compoundsDatasetDatasetTanimoto similarityNode 293Node 294RDKit Fingerprint RDKit Fingerprint Similarity Search Similarity Search Similarity Search Similarity Search Column Rename Column Rename Column Rename Column Rename EnrichmentPlotter (local) EnrichmentPlotter (local) RDKit Fingerprint RDKit Fingerprint Molecule Type Cast RDKit From Molecule Query compound Scatter plot CSV Reader Joiner Joiner Joiner CSV Reader Molecule Type Cast RDKit From Molecule RDKit Fingerprint RDKit Fingerprint Similarity Search Similarity Matrix(from Molecules) Smiles Reader

Nodes

Extensions

Links