Classification

Nodes for building dictionary-based classifiers for text documents. Using a set of labeled sample documents, one can build a dictionary and use it to classify uncategorized documents. Typical use cases for text classification are e.g. automated email spam detection, language identification, or sentiment analysis.

This category contains 7 nodes.

Dataset Reader 

Reader for Palladian’s dataset format.

Text Classifier Learner Streamable

Learner for a dictionary-based text classifier for categorizing text documents.

Text Classifier Model Pruner 

Different pruning methods for Palladian text classifier models.

Text Classifier Model Reader 

This node allows the deserialization of a trained Text Classifier model.

Text Classifier Model to Table 

Converts a text classifier model to a table.

Text Classifier Model Writer 

This node allows serializing a trained Text Classifier model, so that it can be used programmatically within Palladian.

Text Classifier Predictor 

Predictor for a dictionary-based text classifier for categorizing text documents.