Icon

01.2_​Web-retrieving_​Nodes_​Solution

Solution: Retrieving text from the Web

This workflow shows how to extract text from a web page based on its URL.


URLs Session 1Exercise 1.2 - Web-retrieving nodesSummary:In this exercise we are going to extract text from a web page and implementan error handling in case of failureInstructions:1) Execute the Access URLs metanode to load a sample of KNIME blogURLs2) Access the blog posts texts with the Web Text Scraper component3) Put the component inside a try catch loop. Provide an alternative inputfrom the provided local table.4) Build a chunk loop around the try catch loop to test each URL separately Extracting valid data Loop start Hint: Use a chunk loop Loop end Try Catch An alternative input if the GETrequest failed. Requesting for the data using theWeb Text Scraper component Filter the rows without any error message Web Text Scraper Try (Data Ports) Chunk Loop Start Loop End queued Catch Errors(Data Ports) queued Table Creator Row Filter queued Access URLs URLs Session 1Exercise 1.2 - Web-retrieving nodesSummary:In this exercise we are going to extract text from a web page and implementan error handling in case of failureInstructions:1) Execute the Access URLs metanode to load a sample of KNIME blogURLs2) Access the blog posts texts with the Web Text Scraper component3) Put the component inside a try catch loop. Provide an alternative inputfrom the provided local table.4) Build a chunk loop around the try catch loop to test each URL separately Extracting valid data Loop start Hint: Use a chunk loop Loop end Try Catch An alternative input if the GETrequest failed. Requesting for the data using theWeb Text Scraper component Filter the rows without any error message Web Text Scraper Try (Data Ports) Chunk Loop Start Loop End queued Catch Errors(Data Ports) queued Table Creator Row Filter queued Access URLs

Nodes

Extensions

Links