Icon

03 Tagging

Exercise: Tagging documentsIn this exercise you'll add different types of tags to the agenda of the L4-TP instructor-led course.1) Execute the workflow below. It accesses the agenda text from a pdf file and converts it into a document.2) Apply POS tagging to the document. Which POS tags are the most common in the agenda? (You can inspectthe results with the Tagged Document Viewer node) - proper nouns (NNP)3) Apply NE tagging to the document. Which organizations are mentioned in the document? - Microsoft, KNIMEAG, and KNIME GmbH4) Apply wildcard tagging to the document. Match with the regular expressions "\d" and "\D+\d". Apply the tag typeNE and tag value UNKNOWN. Which parts of the agenda did you tag here? - the session identifiers Read course agendaas pdfParts of speechOrganizationsSession identifiersRegular expressions Tika Parser Strings To Document POS Tagger Tagged DocumentViewer StanfordNLPNE Tagger Wildcard Tagger Table Creator Tagged DocumentViewer Tagged DocumentViewer Exercise: Tagging documentsIn this exercise you'll add different types of tags to the agenda of the L4-TP instructor-led course.1) Execute the workflow below. It accesses the agenda text from a pdf file and converts it into a document.2) Apply POS tagging to the document. Which POS tags are the most common in the agenda? (You can inspectthe results with the Tagged Document Viewer node) - proper nouns (NNP)3) Apply NE tagging to the document. Which organizations are mentioned in the document? - Microsoft, KNIMEAG, and KNIME GmbH4) Apply wildcard tagging to the document. Match with the regular expressions "\d" and "\D+\d". Apply the tag typeNE and tag value UNKNOWN. Which parts of the agenda did you tag here? - the session identifiers Read course agendaas pdfParts of speechOrganizationsSession identifiersRegular expressions Tika Parser Strings To Document POS Tagger Tagged DocumentViewer StanfordNLPNE Tagger Wildcard Tagger Table Creator Tagged DocumentViewer Tagged DocumentViewer

Nodes

Extensions

Links