Icon

JKISeason3-13_​tark

Detecting Fraudulent Contracts

Level: Easy to Medium

Description: You work in the contracts department of a software company and are asked to detect fraudulent (or wrong) contracts based on their contract value. Given the PDF versions of the contracts, you need to extract their contract value (and, optionally, any other fields you find useful) and detect outliers among them. You can either use simpler outlier detection techniques, such as those based on statistics or visualization, or more advanced ones based on machine learning.

Author: Lada Rudnitckaia

Dataset: Contract data in the KNIME Community Hub

URL: Datasets https://hub.knime.com/alinebessa/spaces/Just%20KNIME%20It!%20Season%203%20-%20Datasets/Challenge%2013%20-%20Dataset~PY1cXhvh92p6MY-E/
URL: JIKISeason3-13 https://www.knime.com/just-knime-it?pk_vid=f1a9625dd14a14c5171698895027e10b
URL: this challenge thread https://forum.knime.com/t/solutions-to-just-knime-it-challenge-13-season-3/81919

Node 1PDF inthe current workflowdata area folderKNME URLNode 5Split the stringsat line breaks (\n)Split Arr[17]to extractthe product names Split Arr[25]to extractthe contract valuesNode 10Group by the product namesNode 14Node 15Node 16Node 20Split Arr[1]to extractthe investmentagreementNode 22Detect outlierTika Parser List Files/Folders Path to String Table Rowto Variable Cell Splitter Regex Split Regex Split String to Number Group Loop Start Loop End Conditional BoxPlot (JavaScript) Column Renamer Table View Regex Split Joiner Metanode Node 1PDF inthe current workflowdata area folderKNME URLNode 5Split the stringsat line breaks (\n)Split Arr[17]to extractthe product names Split Arr[25]to extractthe contract valuesNode 10Group by the product namesNode 14Node 15Node 16Node 20Split Arr[1]to extractthe investmentagreementNode 22Detect outlierTika Parser List Files/Folders Path to String Table Rowto Variable Cell Splitter Regex Split Regex Split String to Number Group Loop Start Loop End Conditional BoxPlot (JavaScript) Column Renamer Table View Regex Split Joiner Metanode

Nodes

Extensions

Links