Icon

03 Tagging

Exercise: Tagging documentsIn this exercise you'll add different types of tags to the agenda of the L4-TP instructor-led course.1) Execute the workflow below. It accesses the agenda text from a pdf file and converts it into a document.2) Apply POS tagging to the document. Which POS tags are the most common in the agenda? (You can inspectthe results with the Tagged Document Viewer node)The most common tags are the nnp tag (green tag).3) Apply NE tagging to the document. Which organizations are mentioned in the document?Microsoft (in the microsoft word title), KNIME AG, and KNIME Gmbh4) Apply wildcard tagging to the document. Match with the regular expressions "\d" and "\D+\d". Apply the tag typeNE and tag value UNKNOWN. Which parts of the agenda did you tag here?It gave a bunch of unknown tags, by tagged session 2 and all the session numbers Read course agendaas pdfpos taggerview POS tagsNE taggerview NE tagswildcard taggerexpressionsview wildcard tags Tika Parser Strings To Document POS Tagger Tagged DocumentViewer StanfordNLPNE Tagger Tagged DocumentViewer Wildcard Tagger Table Creator Tagged DocumentViewer Exercise: Tagging documentsIn this exercise you'll add different types of tags to the agenda of the L4-TP instructor-led course.1) Execute the workflow below. It accesses the agenda text from a pdf file and converts it into a document.2) Apply POS tagging to the document. Which POS tags are the most common in the agenda? (You can inspectthe results with the Tagged Document Viewer node)The most common tags are the nnp tag (green tag).3) Apply NE tagging to the document. Which organizations are mentioned in the document?Microsoft (in the microsoft word title), KNIME AG, and KNIME Gmbh4) Apply wildcard tagging to the document. Match with the regular expressions "\d" and "\D+\d". Apply the tag typeNE and tag value UNKNOWN. Which parts of the agenda did you tag here?It gave a bunch of unknown tags, by tagged session 2 and all the session numbers Read course agendaas pdfpos taggerview POS tagsNE taggerview NE tagswildcard taggerexpressionsview wildcard tags Tika Parser Strings To Document POS Tagger Tagged DocumentViewer StanfordNLPNE Tagger Tagged DocumentViewer Wildcard Tagger Table Creator Tagged DocumentViewer

Nodes

Extensions

Links