Icon

JKISeason3-9_​tomljh

Web Scraping and Analyzing Economic News

Level: Medium

Description: Recently you became more interested in finance, and since you want to learn more about web scraping for work, you decided to unite both interests. Using the KNIME Web Interaction extension, can you navigate to the Economic News section on Yahoo Finance, extract the headers of only the most recent topics that pop up on the webpage, and then make sense of the results visually? Remember to filter out any ads or unrelated banners/headers/content. Hint: Find class tags in the news' XML that are unique to the content you are scraping.

Author: Thor Landstrom

Note:1.There was no prompt for cookies on my computer, so there was no relevant processing.2.The economic news of this website provides 10 pieces of news by default, but actually it uses the lazyloading technology - scrolling the mouse to the end of the page can load a total of 100 pieces of news.Since this is just an example, no further processing was carried out. pageGet the news linkhttps://finance.yahoo.com/topic/economic-news/Visit the detailed page of the newspageprovidetitledatetimetextClose the browser Web InteractionStart (Labs) Content Retriever(Labs) XPath Navigator (Labs) Navigator (Labs) Table Row toVariable Loop Start Content Retriever(Labs) XPath String Cleaner Loop End Web InteractionEnd (Labs) Tile View(JavaScript) Note:1.There was no prompt for cookies on my computer, so there was no relevant processing.2.The economic news of this website provides 10 pieces of news by default, but actually it uses the lazyloading technology - scrolling the mouse to the end of the page can load a total of 100 pieces of news.Since this is just an example, no further processing was carried out. pageGet the news linkhttps://finance.yahoo.com/topic/economic-news/Visit the detailed page of the newspageprovidetitledatetimetextClose the browserWeb InteractionStart (Labs) Content Retriever(Labs) XPath Navigator (Labs) Navigator (Labs) Table Row toVariable Loop Start Content Retriever(Labs) XPath String Cleaner Loop End Web InteractionEnd (Labs) Tile View(JavaScript)

Nodes

Extensions

Links