Products
Nodes
Workflows

v5.2
Select your KNIME version:v5.2 v5.1 v5.0 v4.7 v4.6 v4.5 v4.4 v4.3 v4.2 v4.1 v4.0 v3.7 v3.6
v5.3Nightly

Log In Sign Up

Home
Workflows
KNIME Hub
Users
knime
Examples
00_Components
Text Processing
Document Similarity Learner

Document Similarity Learner

The Document Similarity Learner develops a model for identifying a new documents most similar matches from an existing corpus of documents. It consumes already processed documents (refer to Document Preprocessing Component) as input and provides as output both the corpus of documents and a model for use with the Document Similarity Predictor Component.

Options

Select preprocessed document column: Select the column containing the preprocessed documents.
Select the term column: Select the column containing the terms.
Number of keywords to extract: Number of keywords to extract from each input document.

Input Ports

: Documents which have already been preprocessed (via Document Preprocessing).

Output Ports

: The reference corpus of documents for future comparison with new documents.
: Model for creating document vectors on new documents in the appropriate, compatible format.

Nodes

Column Selection Configuration2 ×
Breakpoint1 ×
Column Rename1 ×
Component Input1 ×
Component Output1 ×
Show all 11 nodes

Extensions

FeatureKNIME Base nodes
FeatureKNIME Quick Forms
FeatureKNIME Textprocessing

Links

No links available

Download

To use this component in KNIME, download it from the below URL and open it in KNIME:

Download Component

On NodePit since: 2020-04-20

Last update: 2024-04-18

Created with KNIME version: v4.2.0

Deploy, schedule, execute, and monitor your KNIME workflows locally, in the cloud or on-premises – with our brand new NodePit Runner.

Try NodePit Runner!

Intro Options Input Ports Output Ports Nodes Extensions Links

Contact

Do you have feedback, questions, comments about NodePit, want to support this platform, or want your own nodes or workflows listed here as well? Do you think, the search results could be improved or something is missing? Then please get in touch! Alternatively, you can send us an email to mail@nodepit.com, follow @NodePit on Twitter or botsin.space/@nodepit on Mastodon.

Please note that this is only about NodePit. We do not provide general support for KNIME — please use the KNIME forums instead.

NodePit is the world’s first and most comprehensive search engine that allows you to easily search, find and install KNIME nodes and workflows. Explore the KNIME community’s variety. Start mining and follow @NodePit on Twitter or botsin.space/@nodepit on Mastodon.

Impressum / Imprint
Datenschutzerklärung / Privacy Policy
Status

Thank you to the following supporters — without them, NodePit would not be possible!

Legal info: The KNIME® trademark is registered in the United States and Germany by the KNIME GmbH. NodePit is not affiliated with KNIME®.