Clean potentially broken .rdf files (usually "empty" reactions) stemming e.g. from a Reaxys or Scifinder export.
Also prevents error from Erlwood Chem File Reader (note: this file reader is only available up to V4.2.x).
Therefore, a csv files of the rdf contents is also created.
Update Version 20221222. Spresi output support. Some code cleaning.
N.b.: Spresi contains mol based structures in fields other than the reaction as well, these are not converted into the csv.
Works in all Knime V4.x, not tested in 3.x
Uses a Python node, i.e. Python installation required.
Python requires Rdkit in your Python installation.
Output contains columsn with filenames of newly created fixed RDF and resulting CSV files.
Number of columns may vary, this is due to nature of the RDF data.
Even if you don't have or use Erlwood, the resulting CSV files may be parsed & used elsewhere & independently.
Update: Now includes a minimal mol sanitization else errors prevent the csv creation and crashes all, even though the rdf itself does get fixed.
To use this workflow in KNIME, download it from the below URL and open it in KNIME:
Download WorkflowDeploy, schedule, execute, and monitor your KNIME workflows locally, in the cloud or on-premises – with our brand new NodePit Runner.
Try NodePit Runner!