The StanfordNLP NE Learner creates a conditional random field model based on documents and a dictionary with entities that occur in the documents. The chosen tag and the used dictionary will be saved internally, so they can be used by the StanfordNLP NE tagger to tag new documents and validate the model. If you want to use the model externally, the model file can be found at your workflow directory:
/%KNIMEWORKSPACE%/%WORKFLOW%/StanfordNLP NE Learner(##)/port_1/object/portobject.zip
You can select the document column and the dictionary column to train your model with. It is possible to use multi-term entities within the dictionary. There is also a tab in the dialog to specify the learner properties. Currently, there are only a few options, since the number of parameters is pretty huge. So please contact us, if there are important/highly used parameters, we should integrate!
NOTE: If you are interested in the StanfordNLP toolkit, please visit http://nlp.stanford.edu/software/. Some of the following property descriptions are taken from the NERFeatureFactory class from StanfordNLP. Please look into it for further information.
You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.
To use this node in KNIME, install the extension KNIME Textprocessing from the below update site following our NodePit Product and Node Installation Guide:
A zipped version of the software site can be downloaded here.
Do you have feedback, questions, comments about NodePit, want to support this platform, or want your own nodes or workflows listed here as well? Do you think, the search results could be improved or something is missing? Then please get in touch! Alternatively, you can send us an email to email@example.com, follow @NodePit on Twitter, or chat on Gitter!
Please note that this is only about NodePit. We do not provide general support for KNIME — please use the KNIME forums instead.