Nodes for extracting various kind of information mainly from unstructured text.

This category contains 12 nodes.

Column Distance 

Calculates the distance between two input columns.

Corpus Creator 

Creates a corpus which contains the counts of tokens of a document collection.

Date Extractor 

Extracts date and time from text.

Date Extractor Deprecated

Extracts date and time from text.

Hash Calculator Streamable

Calculate hash values for strings and binary data.

Hash Calculator Deprecated

Calculate hash values for strings.

N-Gram Extractor Streamable

Create n-grams for a given string.

Palladian NER 

Named Entity Recognizer from Palladian.

Regex Extractor 

Extract fragments from text using regular expressions.

String Similarity Streamable

Calculate similarities between strings.