Term Neighborhood Extractor

This node extracts the neighborhood for each term of the document. The neighborhood is defined by the N left and N right neighbors that can be set in the dialog. The neighbors can be extracted as term cells or string cells. Each neighbor is stored in a column, however it's also possible to store the right and left neighbors in a collection cell. Additionally, there is an option to extract the sentence in which the term occurs.

Options

Document column
Select the document column that is used for term neighborhood extraction.
Neighbors to extract
The number of (left/right) neighbors to extract.
Extract sentences
If checked, the second column contains the sentence of the document containing the term.
Neighbor Terms as Strings
If checked, neighbors will be extracted as String columns, otherwise as Term columns.
Neighbors as Collection
If checked, the neighbors will be all encapsulated in a collection (one collection for left neighbors, one collection for right neighbors), otherwise there will be one column for each neighbor. If no neighbor exists, the value is a missing value.

Input Ports

Icon
The input table containing the documents.

Output Ports

Icon
An output table containing documents, terms and their N neighbors.

Views

This node has no views

Workflows

Links

Developers

You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.