Icon

News relevance webinar

This workflow was presented at the webinar "KNIME & Redfield Present State-of-the-Art Language Models, Now Available Through Low-Code/No-Code" on September 29th, 2022.

Training XGBoost model with Spacy embeddings after text processing Training XGBoost model with Spacy embeddings after text processing This workflow was presented at the webinar on September 29th, 2022 by Scott Fincher, Perttu Korhonen and Artem Ryasik.The Keyword Search and Text analysis dashboard were inspired by this blog post:https://www.knime.com/blog/semantic-keyword-search-for-seoFor Windows users, please follow this instruction to avoid problems with installation of the extensions:https://learn.microsoft.com/en-us/windows/win32/fileio/maximum-file-path-limitation?tabs=registry en_core_web_mdLabelLabel test recordsUnwind embeddingsas columns80% training20% testColor by relevance classepochs=3l_rate=4e-6max_seq_len=20080% training20% validationLabel test recordsCreate embeddingsfor training dataDimensionalityreductionUnwind embeddingsas columnsLabelledarticles (top)New articles(bottom)80% training + validation20% testLabel new articlesLabel new articlesTopic analysisCo-occurenceTag cloudUnwind embeddingsas columnsLabelledarticlesNew articlesNode 4638Node 4639Spacy ModelSelector Concatenate Number To String XGBoost Predictor Split CollectionColumn Partitioning Color Manager BERT ClassificationLearner BERT Model Selector Partitioning BERT Predictor BERT Embedder t-SNE (L. Jonsson) Split CollectionColumn Row Splitter Partitioning Keyword Search XGBoost TreeEnsemble Learner Column Filter XGBoost Predictor BERT Predictor Text analysisdashboard Split CollectionColumn CSV Reader CSV Reader XGBoost modelevaluation BERT modelevaluation Training XGBoost model with Spacy embeddings after text processing Training XGBoost model with Spacy embeddings after text processing This workflow was presented at the webinar on September 29th, 2022 by Scott Fincher, Perttu Korhonen and Artem Ryasik.The Keyword Search and Text analysis dashboard were inspired by this blog post:https://www.knime.com/blog/semantic-keyword-search-for-seoFor Windows users, please follow this instruction to avoid problems with installation of the extensions:https://learn.microsoft.com/en-us/windows/win32/fileio/maximum-file-path-limitation?tabs=registry en_core_web_mdLabelLabel test recordsUnwind embeddingsas columns80% training20% testColor by relevance classepochs=3l_rate=4e-6max_seq_len=20080% training20% validationLabel test recordsCreate embeddingsfor training dataDimensionalityreductionUnwind embeddingsas columnsLabelledarticles (top)New articles(bottom)80% training + validation20% testLabel new articlesLabel new articlesTopic analysisCo-occurenceTag cloudUnwind embeddingsas columnsLabelledarticlesNew articlesNode 4638Node 4639Spacy ModelSelector Concatenate Number To String XGBoost Predictor Split CollectionColumn Partitioning Color Manager BERT ClassificationLearner BERT Model Selector Partitioning BERT Predictor BERT Embedder t-SNE (L. Jonsson) Split CollectionColumn Row Splitter Partitioning Keyword Search XGBoost TreeEnsemble Learner Column Filter XGBoost Predictor BERT Predictor Text analysisdashboard Split CollectionColumn CSV Reader CSV Reader XGBoost modelevaluation BERT modelevaluation

Nodes

Extensions

Links