Icon

Custom data

This directory contains 8 workflows.

Icon01 Document Viewer 

01 Document Viewer L4-TP SELF-PACED COURSE exercise. Explore documents with the Document Viewer node. VIDEO: Additional Data Types for […]

Icon02 Reading Text Data 

02 Reading Text Data L4-TP SELF-PACED COURSE exercise. Encapsulate text and its metainformation from strings to a document. Access data from a pdf file […]

Icon03 Tagging 

03 Tagging L4-TP SELF-PACED COURSE exercise. Apply parts-of-speech, named entity, and wildcard tagging. VIDEO: Document Tagging: […]

Icon04 Cleaning and Filtering 

04 Cleaning and Filtering L4-TP SELF-PACED COURSE exercise. Remove stop words and punctuation, convert all words to lower case, and include only words […]

Icon05 Bag of Words and Frequencies 

05 Bag of Words and Frequencies L4-TP SELF-PACED COURSE exercise. Create a bag of words of a document. Calculate document frequencies (DF), term […]

Icon06 Document Vector 

06 Document Vector L4-TP SELF-PACED COURSE exercise. Transform a document into a document vector, and apply the document vector model to a new […]

Icon07 Visualizations 

07 Visualizations L4-TP SELF-PACED COURSE exercise. Visualize a document in a word cloud. Show the frequencies of words in a bar […]

Icon08 Topic Detection 

08 Topic Detection L4-TP SELF-PACED COURSE exercise. Assign documents to topics using the LDA algorithm. Infer the contents of each topic based on their […]