SMARTCyp 2.4.2

This Node Is Deprecated — This version of the node has been replaced with a new and improved version. The old version is kept for backwards-compatibility, but for all new workflows we suggest to use the version linked below.
Go to Suggested ReplacementSMARTCyp 2.4.2

This node will run an encapsulated SMARTCyp 2.4.2 jar. The results generated should be the same as those generated by the web service ( and SMARTCyp 2.4.2 jar. To allow for this the CDK adapter cell is generated, a molfile String is then produced and passed into SMARTCyp which will then use the older 1.4.x CDK for calculations. Refactoring efforts to generate a node running CDK 1.5.12 resulting in changed to the output results, use this node to generate 2.4.2 based results.

About SMARTCyp: Description taken from:

SMARTCyp is a method for prediction of which sites in a molecule that are most liable to metabolism by Cytochrome P450. It has been shown to be applicable to metabolism by the isoforms 1A2, 2A6, 2B6, 2C8, 2C19, 2E1, and 3A4 (CYP3A4), and specific models for the isoform 2C9 (CYP2C9) and isoform 2D6 (CYP2D6) are included from version 2.1. CYP3A4, CYP2D6, and CYP2C9 are the three of the most important enzymes in drug metabolism since they are involved in the metabolism of more than half of the drugs used today.

SMARTCyp is Developed by the Deparment of Drug Design and Pharmacology at the University of Copenhagen and is funded by Lhasa Limited. More details can be found at:

Notes on output: If the energy is 999, then there is no matching energy rule. Such sites should not be considered as possible sites of metabolism.

Score, S = E - 8*A - 0.04*SASA

Energy = E
E is an approximate activation energy for the reaction of the catalytic site of a CYP with the molecule at this atom. It is decided by fragment matching of each atom against a lookup table with SMARTS rules and activation energies in kJ/mol.

Accessibility = A
The accessibility is a relative measure of the topological distance for an atom from the center of the molecule, and is always a number between 0.5 (atom at the center) and 1 (atom at the end).

Solvent Accessible Surface Area = SASA The SASA describes the local accessibility of an atom and is computed using the 2DSASA algorithm which predicts this value from the molecular topology (for an exact value 3D coordinates would be necessary).

Known limitations: Since the SMARTCyp method relies heavily on reactivity there are some specific types of sites which often are ranked too high or too low. Sites with very low 3D accessibility are ranked too high Sites which are found as metabolites only due to entropy are ranked too low (for example tertbutyl groups which have nine identical hydrogen atoms). For really large compounds (more than 40 non-hydrogen atoms in CYP3A4) the reactive sites found by SMARTCyp are usually not the experimentally found metabolites. This is probably because the sites of metabolism for such large compounds depend heavily on the binding conformation.

Additional information:

  • Citing SMARTCyp
  • Interpreting results:


Standard Options

The column containing the structure. Choose a column compatible with the CDK adapter value (Smiles, Mol, CDK....)
Index from 0
The SMARTCyp application outputs the atom index starting at 1. The original version of this node was consistent with this approach. To allow easier integration with structure highlighters (which assume indexing from 0) an option is now available to output using a 0 based atom index.
The selected column will be carried over into the output table and used as the identifier for an input row on outputs were multiple rows per input row are created. If no ID column exists use RowID
N Oxidation Correction
Use the new empirical N-oxidation correction implemented by default in version 2.3 and forward. Default true.
Ranking cutoff
Only keep the x highest ranked atoms (duplicate positions kept) for atom highlighting.
Filter detailed
Whether the detailed output table should also be filtered by the ranking cutoff.

Image settings

The width of the result render
The height of the result render
Image format
Options available for rendering PNG and SVG cells using the CDK Depict functionality
The colour to use for all site highlights (regardless of rank).
Overide default colour
If checked the colour selected will be used for highlighted instead of the default rank colours.

Input Ports

A table containing a structure column compatible with CDK adapter cells

Output Ports

A summary of the SMARTCyp predicted sites for the Standard, CYP2C and CYP2D6 models.
This output table details each site(to the selected cutoff). The ID column used will allow grouping of the results to the desired input molecule.


This node has no views


  • No workflows found



You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.