Icon

04.1. Integrated Deployment - solution

Integrated Deployment Solution

Solution to the "Integrated Deployment" exercise for the advanced Life Science User Training
- capture the data preprocessing
- capture the model prediction
- deploy the captured workflow

Activity I: Create Model for Deployment - capture the preprocessing of your data and the model prediction using the Capture Workflow Start and Capture Workflow End nodes - combine your captured workflows using the Workflow Combiner - write/deploy your workflow using the Workflow Writer node Integrated DeploymentThis workflow demonstrates integrated deployment for a Random Forest Model for a bioactivity data set using binary fingerprints.The data set represents a subset of 844 compounds evaluated for activity against CDPK1. 181 compounds inhibited CDPK1 with IC50 below 1uM and have "active" as their class.More information is available https://www.ebi.ac.uk/chemblntd/#tcams_dataset. See Set 19. Step 1: Capture Preprocessing- use the Capture Workflow Start node with data port types tostart capturing your preprocessing part of the workflow- use the RDKit Fingerprint node to create binary fingerprints. - use the Capture Workflow End node with data port types toend your preprocessing part of the workflow Step 2: Capture Prediction Model- use the Capture Workflow Start node with data port types to start capturingyour prediction model- use the Random Forest Predictor node for your model predictions- use the Column Filter node to remove your binary fingerprint column - use the Capture Workflow End node with data port types to end yourprediction model Step 3: Combine capturedWorkflows and Deploy- use the Workflow Combiner node tocombine the preprocessing workflow andthe model prediction workflow (HINT: thefirst part needs to be connected to theupper input port)- use the Workflow Writer node to deployyour workflow TCAMS_CDPK1_subset_ML.table Partitioning Random ForestLearner Random ForestPredictor RDKit Fingerprint Scorer (JavaScript) CaptureWorkflow Start CaptureWorkflow End Workflow Combiner CaptureWorkflow End CaptureWorkflow Start Column Filter Table Reader Workflow Writer Activity I: Create Model for Deployment - capture the preprocessing of your data and the model prediction using the Capture Workflow Start and Capture Workflow End nodes - combine your captured workflows using the Workflow Combiner - write/deploy your workflow using the Workflow Writer node Integrated DeploymentThis workflow demonstrates integrated deployment for a Random Forest Model for a bioactivity data set using binary fingerprints.The data set represents a subset of 844 compounds evaluated for activity against CDPK1. 181 compounds inhibited CDPK1 with IC50 below 1uM and have "active" as their class.More information is available https://www.ebi.ac.uk/chemblntd/#tcams_dataset. See Set 19. Step 1: Capture Preprocessing- use the Capture Workflow Start node with data port types tostart capturing your preprocessing part of the workflow- use the RDKit Fingerprint node to create binary fingerprints. - use the Capture Workflow End node with data port types toend your preprocessing part of the workflow Step 2: Capture Prediction Model- use the Capture Workflow Start node with data port types to start capturingyour prediction model- use the Random Forest Predictor node for your model predictions- use the Column Filter node to remove your binary fingerprint column - use the Capture Workflow End node with data port types to end yourprediction model Step 3: Combine capturedWorkflows and Deploy- use the Workflow Combiner node tocombine the preprocessing workflow andthe model prediction workflow (HINT: thefirst part needs to be connected to theupper input port)- use the Workflow Writer node to deployyour workflow TCAMS_CDPK1_subset_ML.table Partitioning Random ForestLearner Random ForestPredictor RDKit Fingerprint Scorer (JavaScript) CaptureWorkflow Start CaptureWorkflow End Workflow Combiner CaptureWorkflow End CaptureWorkflow Start Column Filter Table Reader Workflow Writer

Nodes

Extensions

Links