Icon

03_​AL_​Training_​Subset_​Uncertain_​Classes

Emil the TeacherBot - Creating a Subset of the Trainingset based on the most Uncertain Predicted ClassThis workflow is part of a number of other workflows that address a data mining scenario at the intersection of active learning, text mining, stream mining and service-oriented knowledgediscovery architectures.This workflow, in particular, allows to create a subset of the training set based on the most uncertain predicted classes.It first read the entire training set. Then, it processes the questions and it predicts the class for each one of those. The loop body allows to compute the differences between the three topprobabilities for each predicted class of each question. Finally, a subset of the entire training set is created based on the most uncertain predicted class and saved as new table. Reading Data Computing the differencies betweeen the Three Top Probabilities for each Predicted Class Process the QuestionsAt this stage the workflowprocesses the questionsand predicts the class foreach one of those. Create a Subset of the TrainingSet based on the most UncertainPredicted Classes Uncertain Predicted ClassComputing diff between three top probabilities Column Filter Transpose ParallelChunk Start Parallel Chunk End Column ListLoop Start Loop End Table Writer Get_Class_Probabilities Loop Body Reading Data Preparing theData to be Saved Emil the TeacherBot - Creating a Subset of the Trainingset based on the most Uncertain Predicted ClassThis workflow is part of a number of other workflows that address a data mining scenario at the intersection of active learning, text mining, stream mining and service-oriented knowledgediscovery architectures.This workflow, in particular, allows to create a subset of the training set based on the most uncertain predicted classes.It first read the entire training set. Then, it processes the questions and it predicts the class for each one of those. The loop body allows to compute the differences between the three topprobabilities for each predicted class of each question. Finally, a subset of the entire training set is created based on the most uncertain predicted class and saved as new table. Reading Data Computing the differencies betweeen the Three Top Probabilities for each Predicted Class Process the QuestionsAt this stage the workflowprocesses the questionsand predicts the class foreach one of those. Create a Subset of the TrainingSet based on the most UncertainPredicted Classes Uncertain Predicted ClassComputing diff between three top probabilities Column Filter Transpose ParallelChunk Start Parallel Chunk End Column ListLoop Start Loop End Table Writer Get_Class_Probabilities Loop Body Reading Data Preparing theData to be Saved

Nodes

Extensions

Links