This node can be used to load webpages by issuing HTTP GET requests
and parsing the
requested HTML webpages. For parsing,
jsoup
is used as library which implements the
WHATWG HTML5
specification. The parsed HTML will be cleaned by removing comments
and, optionally, replacing relative URLs by absolute ones.
By default, the output table will contain a column with the parsed
HTML converted into XHTML. However, you can specify to get the parsed
HTML as string output instead.
The node allows you to either send a request to a fixed URL (which is
specified in the dialog) or to a list of URLs provided by an optional
input table. Every URL will result in one request which in turn will
result in one row in the output table. You can define custom request
headers in the dialog.
The node supports several authentication methods, e.g. BASIC and
DIGEST. Other authentication methods may be provided by additional
extensions.
4xx
and
5xx
) will result in a missing value in the output containing the
appropriate status code as well as the error
message and the node will continue.3xx
).Accept
or
X-Custom-Key
.You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.
To use this node in KNIME, install the extension KNIME REST Client Extension from the below update site following our NodePit Product and Node Installation Guide:
A zipped version of the software site can be downloaded here.
Deploy, schedule, execute, and monitor your KNIME workflows locally, in the cloud or on-premises – with our brand new NodePit Runner.
Try NodePit Runner!Do you have feedback, questions, comments about NodePit, want to support this platform, or want your own nodes or workflows listed here as well? Do you think, the search results could be improved or something is missing? Then please get in touch! Alternatively, you can send us an email to mail@nodepit.com.
Please note that this is only about NodePit. We do not provide general support for KNIME — please use the KNIME forums instead.