1 ×

Scanning_​web_​page_​for_​URLs_​17605

Workflow

https://forum.knime.com/t/scanning-web-page-for-urls/17605Using the following XPath expression to extract all links to MS Excel files://dns:a[dns:img[contains(@class,"xls_icon_mini")]]/@hrefExplanation:Get <a> tags which contain an <img> tag which has a class attribute whichcontains "xls_icon_mini" Table Creator HtmlParser XPath HttpRetriever Column Filter https://forum.knime.com/t/scanning-web-page-for-urls/17605Using the following XPath expression to extract all links to MS Excel files://dns:a[dns:img[contains(@class,"xls_icon_mini")]]/@hrefExplanation:Get <a> tags which contain an <img> tag which has a class attribute whichcontains "xls_icon_mini" Table Creator HtmlParser XPath HttpRetriever Column Filter

Download

Get this workflow from the following link: Download

Nodes

Scanning_​web_​page_​for_​URLs_​17605 consists of the following 5 nodes(s):

Plugins

Scanning_​web_​page_​for_​URLs_​17605 contains nodes provided by the following 3 plugin(s):