Icon

01.2_​Web-retrieving_​Nodes_​Solution

Solution: Retrieving text from the Web

This workflow shows how to extract text from a web page based on its URL.

URLs Session 1Exercise 1.2 - Web-retrieving nodesSummary:In this exercise we are going to extract text from a web page and implement an error handling in caseof failureInstructions:1) Execute the Access URLs metanode to load a sample of KNIME blog URLs2) Access the blog posts texts with the Web Text Scraper component3) Put the component inside a try catch loop. Provide an alternative input from the provided local table.4) Build a chunk loop around the try catch loop to test each URL separately Extracting valid data Loop start Hint: Use a chunk loop Loop end Try Catch An alternative input if the GET request failed. Requesting for the data using the Web TextScraper component Filter the rows without any error messageWeb Text Scraper Try (Data Ports) Chunk Loop Start Loop End Catch Errors(Data Ports) Table Creator Row Filter Access URLs URLs Session 1Exercise 1.2 - Web-retrieving nodesSummary:In this exercise we are going to extract text from a web page and implement an error handling in caseof failureInstructions:1) Execute the Access URLs metanode to load a sample of KNIME blog URLs2) Access the blog posts texts with the Web Text Scraper component3) Put the component inside a try catch loop. Provide an alternative input from the provided local table.4) Build a chunk loop around the try catch loop to test each URL separately Extracting valid data Loop start Hint: Use a chunk loop Loop end Try Catch An alternative input if the GET request failed. Requesting for the data using the Web TextScraper component Filter the rows without any error messageWeb Text Scraper Try (Data Ports) Chunk Loop Start Loop End Catch Errors(Data Ports) Table Creator Row Filter Access URLs

Nodes

Extensions

Links