The purpose of this flow is to identify duplicate invoices.
We prepare a set of invoices labelled as duplicates or non-duplicates.
We have calculated the absolute value of the difference between:
- The amounts of two invoices,
- The dates of two invoices,
- The Levenshtein distance between the invoice numbers,
- The distance between the two BERT vectors of the descriptions.
Next, we create a model trained on this dataset based on the neural network.
To use this workflow in KNIME, download it from the below URL and open it in KNIME:
Download WorkflowDeploy, schedule, execute, and monitor your KNIME workflows locally, in the cloud or on-premises – with our brand new NodePit Runner.
Try NodePit Runner!