This node applies the provided embedding model to create embeddings of the texts contained in a string column of the input table.
A text embedding is a dense vector of floating point values capturing the semantic meaning of the text by mapping it to a high-dimensional space. Similarities between embedded entities are then derived by how close they are to each other in said space. These embeddings are often used to find semantically similar documents e.g. in vector stores.
Different embedding models encode text differently, resulting in incomparable embeddings. If this node fails to execute with 'Execute failed: Error while sending a command.', refer to the description of the node that provided the embedding model.
Note: If you use the Credentials Configuration node and do not select the "Save password in configuration (weakly encrypted)" option for passing the API key for the Embedding Model Selector node, the Credentials Configuration node will need to be reconfigured upon reopening the workflow, as the credentials flow variable was not saved and will therefore not be available to downstream nodes.
The string column containing the texts to embed.
Name for output column that will hold the embeddings.
Define whether missing or empty values in the text column should result in missing values in the output table or whether the node execution should fail on such values.
Available options:
You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.
To use this node in KNIME, install the extension KNIME Python Extension Development (Labs) from the below update site following our NodePit Product and Node Installation Guide:
A zipped version of the software site can be downloaded here.
Deploy, schedule, execute, and monitor your KNIME workflows locally, in the cloud or on-premises – with our brand new NodePit Runner.
Try NodePit Runner!