Icon

00_​Guided_​Labeling_​for_​Document_​Classification_​(legacy)

Guided Labeling for Document Classification (Legacy)

Updated workflow with 4.1 nodes available. Check references below.

This old legacy workflow defines a fully automated web based application that will label your data using active learning. The workflow was designed for business analysts to easily go through documents to be labeled in any number of classes. In each iteration the user labels more documents and the model is trained using the already labeled instances. With every new iteration, the model proposes the most uncertain documents using the entropy scorer node. Once the user is happy with the performance achieved with the available labels, they can exit the loop and export the model to label the remaining instances.

This workflow is made to be deployed on KNIME WebPortal via KNIME Server.




Guided Labeling for Document Classification (Legacy)This workflow defines a fully automated web based application that will label your data using active learning and uncertainty sampling.This workflow is made to be deployed on KNIME WebPortal via KNIME Server. To test the Guided Analytics application on KNIME Analytics Platform:- Right click the Label Component and "Execute and Open Views"- Follow the in-view instructions- After saving your interactions, right click the Active Learning Loop End and "Step Loop Execution"- Open the Label Component view again to see the second iteration of the human-in-the-loop The Process Step by Step1. Upload your documents and enter / upload the labels you want to use2. Start labeling your data3. Monitor model accuracy as you provide more labels4. When the accuracy reaches a desired amount, exit the loop5. Download the model and the labels, and visualize the results Show user currentpredictions and ask formore labels. Allow user to download themodel trained on all the labelddata. top output : data to train modelbottom output:past iterations statsCreate empty table to track accuracies.top: new labelsbottom : new statsRemove current iter. statsRemove current iter. probsport 0: already labeledport 1: still to be labeledport 2: iterations statsRecursive LoopStart (2 ports) Table Creator Recursive LoopEnd (2 ports) Label Concatenate Text Preprocessing Upload Deploy Pre-process forVisualization Joiner Column Filter Column Splitter Initialize / Train Classifierwith Available Labels Guided Labeling for Document Classification (Legacy)This workflow defines a fully automated web based application that will label your data using active learning and uncertainty sampling.This workflow is made to be deployed on KNIME WebPortal via KNIME Server. To test the Guided Analytics application on KNIME Analytics Platform:- Right click the Label Component and "Execute and Open Views"- Follow the in-view instructions- After saving your interactions, right click the Active Learning Loop End and "Step Loop Execution"- Open the Label Component view again to see the second iteration of the human-in-the-loop The Process Step by Step1. Upload your documents and enter / upload the labels you want to use2. Start labeling your data3. Monitor model accuracy as you provide more labels4. When the accuracy reaches a desired amount, exit the loop5. Download the model and the labels, and visualize the results Show user currentpredictions and ask formore labels. Allow user to download themodel trained on all the labelddata. top output : data to train modelbottom output:past iterations statsCreate empty table to track accuracies.top: new labelsbottom : new statsRemove current iter. statsRemove current iter. probsport 0: already labeledport 1: still to be labeledport 2: iterations statsRecursive LoopStart (2 ports) Table Creator Recursive LoopEnd (2 ports) Label Concatenate Text Preprocessing Upload Deploy Pre-process forVisualization Joiner Column Filter Column Splitter Initialize / Train Classifierwith Available Labels

Nodes

Extensions

Links