Icon

Extraction

Nodes for extracting various kind of information mainly from unstructured text.

This category contains 16 nodes.

Column Distance StreamableDeprecated

Calculates the distance between two input columns.

Column Distance Streamable

Calculates the distance between two input columns.

Corpus Creator Streamable

Creates a corpus which contains the counts of tokens of a document collection.

Date Extractor Streamable

Extracts date and time from text.

Date Extractor Deprecated

Extracts date and time from text.

Empty String to Missing Value Streamable

Replace empty string with missing values.

Hash Calculator Streamable

Calculate hash values for strings and binary data.

Hash Calculator Deprecated

Calculate hash values for strings.

N-Gram Extractor Streamable

Create n-grams for a given string.

Palladian NER 

Named Entity Recognizer from Palladian.