Dataset Reader

Go to Product

This node enables reading dataset in Palladian’s format, usually used for text classification. It consists of one index file, which contains a list of files referenced by the relative paths and an assigned class for each file, separated by an arbitrary separation character (usually a space).

Options

Index file
The index txt file containing a list of documents by their relative path to the index file and the assigned classes.
Separation string
The string or character used for separating the file reference from the class, usually a single space character.

Input Ports

This node has no input ports

Output Ports

Icon
The read dataset with the text content and the assigned class.

Views

This node has no views

Workflows

  • No workflows found

Links

Developers

You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.