Icon

Extraction

Nodes for extracting various kind of information mainly from unstructured text.

This category contains 13 nodes.

Column Distance Streamable

Calculates the distance between two input columns.

Corpus Creator Streamable

Creates a corpus which contains the counts of tokens of a document collection.

Date Extractor Streamable

Extracts date and time from text.

Date Extractor Deprecated

Extracts date and time from text.

Hash Calculator Streamable

Calculate hash values for strings and binary data.

Hash Calculator Deprecated

Calculate hash values for strings.

N-Gram Extractor Streamable

Create n-grams for a given string.

Palladian NER 

Named Entity Recognizer from Palladian.

Regex Extractor Deprecated

Extract fragments from text using regular expressions.

Regex Extractor Streamable

Extract fragments from text using regular expressions.