Extraction

Nodes for extracting various kind of information mainly from unstructured text.

This category contains 12 nodes.

Column Distance 

Calculates the distance between two input columns.

Corpus Creator 

Creates a corpus which contains the counts of tokens of a document collection.

Date Extractor Deprecated

Extracts date and time from text.

Date Extractor 

Extracts date and time from text.

Hash Calculator Deprecated

Calculate hash values for strings.

Hash Calculator 

Calculate hash values for strings and binary data.

N-Gram Extractor Streamable

Create n-grams for a given string.

Palladian NER 

Named Entity Recognizer from Palladian.

Regex Extractor 

Extract fragments from text using regular expressions.

String Similarity 

Calculate similarities between strings.