Generates a Forge™ Machine Learning regression or classification model for activity from a set of aligned molecules.
The molecules must be pre-aligned for all model types except using 2D descriptors (see below): the falign program or Forge Align node are ideal for this.
The following types of models can be generated.
k Nearest Neighbor (kNN) regression or classification
The kNN methodology is a well-known and robust machine learning approach where the activity for each compound is predicted as the weighted average activity of its k nearest neighbors (most similar compounds) in the training set.
The similarity between the molecules is calculated using either Cresset's field/shape similarity or by using 2D circular fingerprints (ECFP4, ECFP6, FCFP4, or FCFP6).
Random Forest (RF) regression
The aligned training set of molecules are used to derive a set of sampling points around the molecules based on their field points, which can be used to probe any molecule for the electrostatic potential or for the volume taken up by molecules. The data matrix derived from sample values is then processed to generate a Random Forest model.
Relevance Vector Machine (RVM) regression or classification
The aligned training set of molecules are used to derive a set of sampling points around the molecules based on their field points, which can be used to probe any molecule for the electrostatic potential or for the volume taken up by molecules. The data matrix derived from sample values is then processed to generate a RVM model.
Support Vector Machine (SVM) regression or classification
The aligned training set of molecules are used to derive a set of sampling points around the molecules based on their field points, which can be used to probe any molecule for the electrostatic potential or for the volume taken up by molecules. The data matrix derived from sample values is then processed to generate a SVM model.
These models can be used within the 'Forge Score Machine Learning' node (wrapping the 'fscore' executable) to predict an activity value for newly designed molecules.
Please refer to the Forge manual for a detailed description of the science behind each of these model types in Forge and the corresponding model building options.
This node wraps the Forge Build executable 'fbuild', which must be installed with a valid license for this node to work. If this is installed in the default location on Windows, then it should be found automatically. Otherwise, you must either set the 'Cresset Home' preference or the CRESSET_HOME environment variable to the base Cresset software install directory. You may also set the 'fbuild Path' preference or the CRESSET_FORGEBUILD_EXE environment variable to point directly at the executable itself.
The Forge Build Machine Learning node can be configured to use additional resources to perform calculations. The time taken for the node to run will be drastically reduced using the Cresset's Engine Broker. To use this facility either set the 'Cresset Engine Broker' preference or the CRESSET_BROKER environment variable to point to the location of your local Engine Broker. If you do not currently have the Cresset Engine Broker then contact Cresset (enquiries@cresset-group.com) for pricing on local and cloud based brokers.
For more information visit www.cresset-group.com or contact us at support@cresset-group.com.
You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.
To use this node in KNIME, install the extension Cresset KNIME Nodes from the below update site following our NodePit Product and Node Installation Guide:
A zipped version of the software site can be downloaded here.
Deploy, schedule, execute, and monitor your KNIME workflows locally, in the cloud or on-premises – with our brand new NodePit Runner.
Try NodePit Runner!Do you have feedback, questions, comments about NodePit, want to support this platform, or want your own nodes or workflows listed here as well? Do you think, the search results could be improved or something is missing? Then please get in touch! Alternatively, you can send us an email to mail@nodepit.com.
Please note that this is only about NodePit. We do not provide general support for KNIME — please use the KNIME forums instead.