Icon

CI_​IA_​AP-Duplicate_​Invoice_​Detection

The purpose of this flow is to identify duplicate invoices.
We prepare a set of invoices labelled as duplicates or non-duplicates.
We have calculated the absolute value of the difference between:
- The amounts of two invoices,
- The dates of two invoices,
- The Levenshtein distance between the invoice numbers,
- The distance between the two BERT vectors of the descriptions.
Next, we create a model trained on this dataset based on the neural network.

Data Preprocessing Training Model evaluation and export Data Preprocessing Deploying Reporting Deploying Training extract prediction Excel Reader Excel Reader Keras NetworkLearner Keras NetworkExecutor Scorer Rule Engine Partitioning Keras NetworkWriter DataPreparation Keras NetworkReader Keras NetworkExecutor Data Preparation Excel Writer Column Filter Networkarchitecture Data Preprocessing Training Model evaluation and export Data Preprocessing Deploying Reporting Deploying Training extract prediction Excel Reader Excel Reader Keras NetworkLearner Keras NetworkExecutor Scorer Rule Engine Partitioning Keras NetworkWriter DataPreparation Keras NetworkReader Keras NetworkExecutor Data Preparation Excel Writer Column Filter Networkarchitecture

Nodes

Extensions

Links