This node can be used to retrieve webpages by issuing HTTP GET requests
and parsing the
requested HTML webpages. For parsing,
jsoup
is used as library which implements the
WHATWG HTML5
specification. The parsed HTML will be cleaned by removing comments
and, optionally, replacing relative URLs by absolute ones.
By default, the output table will contain a column with the parsed
HTML converted into XHTML. However, you can specify to get the parsed
HTML as string output instead.
The node allows you to either send a request to a fixed URL (which is
specified in the dialog) or to a list of URLs provided by an optional
input table. Every URL will result in one request which in turn will
result in one row in the output table. You can define custom request
headers in the dialog.
The node supports several authentication methods, e.g. BASIC and
DIGEST. Other authentication methods may be provided by additional
extensions.
Cookies can be send to the server via the Request Header tab by setting the "Cookie" header.
In order to receive cookies, set the "Extract cookies" option. Any cookies sent by the server
are then extracted and appended as a List Cell in the output.
3xx
).Accept
or X-Custom-Key
. Note that some
header keys such as Origin
are silently ignored by default for security reasons. You can configure
KNIME AP to allow any header key by setting the sun.net.http.allowRestrictedHeaders
system property in the
knime.ini configuration file to true
.You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.
To use this node in KNIME, install the extension KNIME REST Client Extension from the below update site following our NodePit Product and Node Installation Guide:
A zipped version of the software site can be downloaded here.
Do you have feedback, questions, comments about NodePit, want to support this platform, or want your own nodes or workflows listed here as well? Do you think, the search results could be improved or something is missing? Then please get in touch! Alternatively, you can send us an email to mail@nodepit.com, follow @NodePit on Twitter, or chat on Gitter!
Please note that this is only about NodePit. We do not provide general support for KNIME — please use the KNIME forums instead.