Icon

DataPrep

This directory contains 4 workflows.

Icon01_​Fetch_​BioAssays 

This is the first workflow in the PubChem Big Data story. In the top part of the workflow we download the assay data from the PubChem database using its […]

Icon02_​Pivot_​PubChemData 

This is the second workflow in the PubChem Big Data story. In the top part of the workflow we pivot the assay data using KNIME Extension for Apache […]

Icon03_​Fetch_​SMILES 

This is the third workflow in the PubChem Big Data story. First, we obtain the SMILES of the necessary CIDs using PubChem REST services. Then, we use […]

Icon04_​Generate_​Features 

This is the forth workflow in the PubChem Big Data story. We prepare three datasets for the machine learning experiments. Set 1: Compounds, their […]