Icon

03 Tagging

Exercise: Tagging documentsIn this exercise you'll add different types of tags to the agenda of the L4-TP instructor-led course.1) Execute the workflow below. It accesses the agenda text from a pdf file and converts it into a document.2) Apply POS tagging to the document. Which POS tags are the most common in the agenda? (You can inspectthe results with the Tagged Document Viewer node) Nnp tags are the most common3) Apply NE tagging to the document. Which organizations are mentioned in the document? Microsoft, KnimeAG, Knime GmbH4) Apply wildcard tagging to the document. Match with the regular expressions "\d" and "\D+\d". Apply the tag typeNE and tag value UNKNOWN. Which parts of the agenda did you tag here? The Section numbers Read course agendaas pdfNode 6Node 7Node 8Node 9Node 10Node 11Node 12 Tika Parser Strings To Document POS Tagger Tagged DocumentViewer StanfordNLPNE Tagger Tagged DocumentViewer Wildcard Tagger Table Creator Tagged DocumentViewer Exercise: Tagging documentsIn this exercise you'll add different types of tags to the agenda of the L4-TP instructor-led course.1) Execute the workflow below. It accesses the agenda text from a pdf file and converts it into a document.2) Apply POS tagging to the document. Which POS tags are the most common in the agenda? (You can inspectthe results with the Tagged Document Viewer node) Nnp tags are the most common3) Apply NE tagging to the document. Which organizations are mentioned in the document? Microsoft, KnimeAG, Knime GmbH4) Apply wildcard tagging to the document. Match with the regular expressions "\d" and "\D+\d". Apply the tag typeNE and tag value UNKNOWN. Which parts of the agenda did you tag here? The Section numbers Read course agendaas pdfNode 6Node 7Node 8Node 9Node 10Node 11Node 12 Tika Parser Strings To Document POS Tagger Tagged DocumentViewer StanfordNLPNE Tagger Tagged DocumentViewer Wildcard Tagger Table Creator Tagged DocumentViewer

Nodes

Extensions

Links