Icon

06_​NY_​Times_​RSS_​Feed_​Tag_​Cloud

Extraction and Tag Cloud Visualization of Named Entities from New York Times News Feeds

The workflow starts with a URL to a NY Times rss news feed. The news feed is downloaded and parsed and transformed in DocumentCells.

Names of persons, organizations and locations are then recognized and the corresponding tags are assigned, in order to apply a coloring based on a tag type later on.

After transformation into a bag of words, and filtering of all non-persons, -organizations, or –locations colors are assigned and the terms are visualized via a Tag Cloud.

This workflow parses a NY Times RSS news Feed, tahs persons, organization, and locations and visualizes them in a tagcloud. Downloading of latest NY Times rss feed and transformation into document cells Named Entity Recognition Tag Cloud Visualization Preparation Color by tag (person, organization, location)NY Times rss feedTag personsTag organizationsTag locations Extract each post into a rowExtract title,description,...Tags To String TF Color Manager Tag Cloud GroupBy Table Creator Column Filter(deprecated) Tag Filter Strings To Document OpenNLP NE Tagger OpenNLP NE Tagger OpenNLP NE Tagger Bag Of WordsCreator Tag Cloud Webpage Retriever XPath XPath Transform Data&Time This workflow parses a NY Times RSS news Feed, tahs persons, organization, and locations and visualizes them in a tagcloud. Downloading of latest NY Times rss feed and transformation into document cells Named Entity Recognition Tag Cloud Visualization Preparation Color by tag (person, organization, location)NY Times rss feedTag personsTag organizationsTag locations Extract each post into a rowExtract title,description,...Tags To String TF Color Manager Tag Cloud GroupBy Table Creator Column Filter(deprecated) Tag Filter Strings To Document OpenNLP NE Tagger OpenNLP NE Tagger OpenNLP NE Tagger Bag Of WordsCreator Tag Cloud Webpage Retriever XPath XPath Transform Data&Time

Nodes

Extensions

Links