1 ×

HTML Node to Text

StreamablePalladian Nodes for KNIME Workbench version 1.8.0.201907271536 by palladian.ws; Philipp Katz, Klemens Muthmann, David Urbansky.

Converts HTML markup to a more or less human-readable string. For example, insert line breaks for HTML block level elements such as <p>, filter comments, scripts, and stylesheets, remove unnecessary white space, and much more.

Options

Input
Column in the input table with the DOM documents to transform.

Input Ports

Input with (X)HTML documents parsed as DOM/XML.

Output Ports

Input table with appended column containing the text.

Best Friends (Incoming)

Best Friends (Outgoing)

Workflows

Installation

To use this node in KNIME, install Palladian for KNIME from the following update site:

KNIME 4.0
Wait a sec! You want to explore and install nodes even faster? We highly recommend our NodePit for KNIME extension for your KNIME Analytics Platform.

Developers

You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.