The Cache node materializes and caches the input table in a data processing workflow. This node is useful after a sequence of preprocessing steps, especially when these steps involve column transformations, such as removing, manipulating, or adding new columns.
In workflows involving multiple transformation nodes, only the modified data (e.g., added columns) is stored, while the unmodified columns reference the input data. Although this approach optimizes the execution and data caching for individual nodes, it can result in tables that are composites of multiple nested tables. Consequently, iterating over such composite tables may be less efficient compared to iterating over a single, unified table.
The Cache node addresses this by materializing the input data, creating a self-contained table that consolidates all columns. Additionally, the Cache node is useful in scenarios where portions of a workflow are executed in streaming mode, as it allows data to be staged at specific points. This staging facilitates inspection and debugging, providing a snapshot of the data at the desired point in the workflow.
Specify whether to take domains of all input columns as output domains as-is or compute them on the output rows.
Depending on the use case, one or the other setting may be preferable:
If column domains are irrelevant for a particular use case, the "Retain" option should be used since it does not incur computation costs.
You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.
To use this node in KNIME, install the extension KNIME Base nodes from the below update site following our NodePit Product and Node Installation Guide:
A zipped version of the software site can be downloaded here.
Deploy, schedule, execute, and monitor your KNIME workflows locally, in the cloud or on-premises – with our brand new NodePit Runner.
Try NodePit Runner!Do you have feedback, questions, comments about NodePit, want to support this platform, or want your own nodes or workflows listed here as well? Do you think, the search results could be improved or something is missing? Then please get in touch! Alternatively, you can send us an email to mail@nodepit.com.
Please note that this is only about NodePit. We do not provide general support for KNIME — please use the KNIME forums instead.