Icon

Thesis 2

Db preparation

Text mining

1. Data Preparation for Topic Models. Preprocessing, n-grams, exclusion of reviews with a small number of terms can be adjusted as desired

3. Obtain topic solution. Users can test more than 1 topic solution and choose based on interpretability.

2. Find optimal k. Other methods can be implemented in KNIME (https://hub.knime.com/angusveitch/spaces/Public/latest/TopicKR~HRMp6v9Ip_ODMIob). Other Topic model algorithms that can be used in R or python are structural topic models (STM) and correlated topic models (CTM).
Table Creator
Excel Writer
1. Doc Creation
Column Filter
Column Resorter
Column Renamer
Column Filter
5. Filter Reviews with Less than 10 words
Bar Chart
Papers per topic
GroupBy
Math Formula
Column Appender
In a range of topicsidentify elbow range in image
CHI-Square
Excel Writer
Column Appender
bi-grams (tri-grams can be added as well)
4. N-grams
Row Filter
Excel Reader
GroupBy
GroupBy
Row Filter
String Manipulation
Excel Writer
Row Filter
Reference Row Filter
IEEE explorer
Excel Reader
Scopus
Excel Reader
Row Filter
Row Filter
Row Filter
Column Filter
Column Filter
Term frequency
Table to JSON
Column Renamer
Sorter
Column Filter
Missing Value
Column Filter
doc lenght
Table to JSON
Pivot
phi
Table to JSON
replace withtopic names
Table Creator
String Replacer (Dictionary)
Color Manager
theta
Table to JSON
Column Filter
Column Renamer
Bar Chart
Column Renamer
In a range of topicsidentify elbow range in image
Perplexity index
Step 3: Select a topic #example K=4
Topic Extractor (Parallel LDA)
2. Preprocessing
Joiner
Tag Cloud
Excel Writer
3. Token filter
Summary words per topic
GroupBy
Missing Value
GroupBy
Joiner
Row Filter
Column Renamer
GroupBy
Excel Reader
Column Renamer
Concatenate
Duplicate Row Filter
Column Filter
Column Aggregator
Column Filter
Column Renamer
Excel Reader
Column Aggregator
GroupBy
Column Filter
Vocab
Table to JSON
GroupBy
String Manipulation
Joiner
String Manipulation
Source da tenere
Table Creator
Visualize
Generic JavaScript View (JavaScript) (legacy)
Reference Row Filter
String to Term
Table Row to Variable (deprecated)
Create visualizationas HTML
Python Script (1⇒1) (deprecated)
Column Filter
Concatenate
Create visualizationas HTML
Python Script (1⇒1) (deprecated)
Column Resorter
Reference Row Filter
Color Manager
GroupBy
JSON Reader
Source da togliere
Table Creator
Rule Engine
CSV Writer
Row Filter
CSV Writer
Column Resorter
Excel Writer
Math Formula
Concatenate
gemini phi
Column Resorter
Column Resorter
Excel Reader
CSV Writer
Column Appender
CSV Writer
Duplicate Row Filter
vocab freq gemini
Column Renamer
Python Script
doc.length gemini
Column Renamer

Nodes

Extensions

Links