Icon

DBPedia_​texts_​classification_​with_​BERT_​by_​Redfield

DBPedia texts classification with BERT by Redfield

The workflow uses enhanced version that includes Portuguese texts. English data is taken from here:
https://www.kaggle.com/danofer/dbpedia-classes
The workflow show how to use BERT extension for Knime by Redfield to train models for text classification.

Required Python packages (need to be available in your TensorFlow 2 Python environment):
bert==2.2.0
bert-for-tf2==0.14.4
Keras-Preprocessing==1.1.2
numpy==1.19.1
pandas==0.23.4
pyarrow==0.11.1
tensorboard==2.2.2
tensorboard-plugin-wit==1.7.0
tensorflow==2.2.0
tensorflow-estimator==2.2.0
tensorflow-hub==0.8.0
tokenizers==0.7.0
tqdm==4.48.0
transformers==3.0.2

The workflow uses enhanced version that includes Portuguese texts. English data is taken from here:https://www.kaggle.com/danofer/dbpedia-classesThe workflow show how to use BERT extension for Knime by Redfield to train models for text classification. Node 15l1Node 283l2l1l1l2l2Node 310Node 313Update table specUpdate table specBERT Model Selector Filter rare classesand partition Count classrepresentatives Table Reader Text assessment Count classrepresentatives Filter rare classesand partition BERT configuration BERT ClassificationLearner BERT Predictor BERT Predictor BERT ClassificationLearner Misclassificationanalysis Timer Info Timer Info Domain Calculator Domain Calculator Misclassificationanalysis The workflow uses enhanced version that includes Portuguese texts. English data is taken from here:https://www.kaggle.com/danofer/dbpedia-classesThe workflow show how to use BERT extension for Knime by Redfield to train models for text classification. Node 15l1Node 283l2l1l1l2l2Node 310Node 313Update table specUpdate table spec BERT Model Selector Filter rare classesand partition Count classrepresentatives Table Reader Text assessment Count classrepresentatives Filter rare classesand partition BERT configuration BERT ClassificationLearner BERT Predictor BERT Predictor BERT ClassificationLearner Misclassificationanalysis Timer Info Timer Info Domain Calculator Domain Calculator Misclassificationanalysis

Nodes

Extensions

Links