This node implements Presidio's Analyzer, which allows to detect Personal Identifiable Information (PII) in English text data.
The node analyzes the data of a specified string column of the input table for specified PII entity types. It adds the detected entities to the input table by appending the following columns:
Rows with multiple entities will be ungrouped so that each row contains one entity.
Further information on the Presidio Analyzer can be found on the Microsoft Presidio website.
Warning: Presidio can help identify sensitive/PII data in un/structured text. However, because it is using automated detection mechanisms, there is no guarantee that Presidio will find all sensitive information. Therefore, always evaluate the quality of detections and take appropriate measures if necessary.
Select the string column that contains the data for PII detection.
Select the PII entity types that will be detected.
Available options:
You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.
To use this node in KNIME, install the extension KNIME Python Extension Development (Labs) from the below update site following our NodePit Product and Node Installation Guide:
A zipped version of the software site can be downloaded here.
Deploy, schedule, execute, and monitor your KNIME workflows locally, in the cloud or on-premises – with our brand new NodePit Runner.
Try NodePit Runner!