Clean potentially broken .rdf files (usually "empty" reactions) stemming e.g. from a Reaxys or Scifinder export.
Also prevents error from Erlwood Chem File Reader (note: this file reader is only available up to V4.2.x).
Therefore, a csv files of the rdf contents is also created.
Update Version 20221222. Spresi output support. Some code cleaning.
N.b.: Spresi contains mol based structures in fields other than the reaction as well, these are not converted into the csv.
Works in all Knime V4.x, not tested in 3.x
Uses a Python node, i.e. Python installation required.
Python requires Rdkit in your Python installation.
Output contains columsn with filenames of newly created fixed RDF and resulting CSV files.
Number of columns may vary, this is due to nature of the RDF data.
Even if you don't have or use Erlwood, the resulting CSV files may be parsed & used elsewhere & independently.
Update: Now includes a minimal mol sanitization else errors prevent the csv creation and crashes all, even though the rdf itself does get fixed.
To use this workflow in KNIME, download it from the below URL and open it in KNIME:
Download WorkflowDeploy, schedule, execute, and monitor your KNIME workflows locally, in the cloud or on-premises – with our brand new NodePit Runner.
Try NodePit Runner!Do you have feedback, questions, comments about NodePit, want to support this platform, or want your own nodes or workflows listed here as well? Do you think, the search results could be improved or something is missing? Then please get in touch! Alternatively, you can send us an email to mail@nodepit.com.
Please note that this is only about NodePit. We do not provide general support for KNIME — please use the KNIME forums instead.