Icon

_​knime_​performance_​and_​tuning

A meta collection about KNIME and performance and performance tuning and some problems

A meta collection about KNIME and performance and performance tuning and some problems

Please be aware: there might not be the one perfect setting for every task and system. Read the entries and find the ones that fit your needs. And be aware: often there is no substituting good setup and strong hardware performance with super clever settings - having said that there are some things you *could* check out:

- overall performance of your system (allocated RAM, CPU, Disk). Give KNIME what you can but leave some for your system (cf. links)
- permissions: do you (and KNIME) have full access to all files (there was an issue with MacOS)
- virus scanner - KNIME has a lot of small files, make sure no virus scanner is agressivly blocking them
- settings about data storage and Java heap space (and temporary folders)
- restart your KNIME once with a -clean option in the knime.ini (cf. links)
- maybe do a clean new installation in a new folder (if on windows maybe just un-zip the zip version into a clean folder)
- check your storage/disk - be aware of possible issues with OneDrive [special characters like # and "(" ")" must be allowed/supported]
- think about a suitable backup strategy for your work. Things can go wrong. You might want to revert to an earlier version of your work
- research this collection or on the KNIME forum (forum.knime.com)

Start with the official blog about KNIME performance
https://www.knime.com/blog/optimizing-knime-workflows-for-performance
(yes in theory such a blog might cover it all but there is more to a large system like KNIME)

Check your installation (on a Windows machine use the .ZIP file if in doubt) - share debug log with the community or support
https://forum.knime.com/t/multiple-problems-have-occurred-message-pop-up-on-knime-startup/38655/2?u=mlauber71

Long entry and thread about performance and KNIME "Process 900+ CSV files"
https://forum.knime.com/t/processing-hundreds-of-millions-of-records/13593/5?u=mlauber71

Additional thread about performance and collection of links
https://forum.knime.com/t/large-data-tables-missing/13108/2?u=mlauber71

If you are stuck and KNIME is not responding or behaving strangely try a clean restart *once*
https://forum.knime.com/t/knime-update-next-button-is-not-moving/15539/4?u=mlauber71

Write table to disk and check Java heap space as options to deal with performance issues
https://forum.knime.com/t/knime-3-6-crash-when-dealing-with-massive-data/12145/2?u=mlauber71

Check your virus scanner if it hinders performance
https://forum.knime.com/t/knime-takes-long-to-load/12669/4?u=mlauber71

Think about using the Cache node at certain points and you might run the Java Garbage Collector to free memory
https://forum.knime.com/t/execute-failed-cannot-read-file-knime-container-20200301-1905559478659870280-bin-snappy/21461/19?u=mlauber71
https://hub.knime.com/search?q=garbage&type=Node

Also sometimes there are problems with these 'snappy' files. Switching the internal storage option might help
https://forum.knime.com/t/blob-and-buffer-errors/25915/2?u=mlauber71

Tweak the internal storage (format) of KNIME workflows
https://forum.knime.com/t/knime-is-slowing-down/12619/2?u=mlauber71

You can also try out the new "Columnar Table Backend" (based on Apache Arrow) for internal storage
https://www.knime.com/blog/improved-performance-with-new-table-backend?u=mlauber71


To speed up large data processing think about 'streaming' or using Parquet or ORC files (partitioned files)
https://forum.knime.com/t/problem-with-disk-space-during-workflow-execution/48435/6?u=mlauber71

Find the KNIME temporary files and folders
https://forum.knime.com/t/question-about-memory-policy-in-node-options/36839/2?u=mlauber71

Be aware that some configurations of MS OneDrive might not support all the spacial chacaters like hash (#) that KNIME workflows need
https://forum.knime.com/t/can-not-import-workflow/11115/4?u=mlauber71
https://forum.knime.com/t/can-not-open-knime-workflows-nor-create-new-workflows-after-migration-to-a-new-drive/19058/4?u=mlauber71

On MacOS make sure that KNIME has all the necessary access rights
https://forum.knime.com/t/read-permission-error-in-csv-reader/20879/24?u=mlauber71
https://forum.knime.com/t/macos-catalina-cannot-read-from-file-system/24472/3?u=mlauber71
=> that is also true for other systems of course

Column rename node is not good to change the column type (sorry)
https://forum.knime.com/t/knime-nodes-corrupting-on-multiple-computers/13277/7?u=mlauber71

KNIME and Spark 1
https://forum.knime.com/t/how-to-speed-up-the-spark-to-datebase-node/13288/2?u=mlauber71

KNIME and Spark 2
https://forum.knime.com/t/spark-context-how-to-cache-the-intermediate-data-and-not-write-it-back-while-doing-a-series-of-transformations-on-the-data/20881/4?u=mlauber71

Streaming data in KNIME
https://www.knime.com/blog/streaming-data-in-knime

Save and don't save data in a workflow
https://forum.knime.com/t/save-workflow-without-data/27843/2?u=mlauber71

Create your own backup system with KNIME and ZIP
https://forum.knime.com/t/copy-move-files-node/29412/15?u=mlauber71

KNIME and Backup - think about protecting your work(flows)
https://forum.knime.com/t/restore-knime-workflows/36812/2?u=mlauber71

Create path variables in KNIME
https://forum.knime.com/t/write-multiple-csv-file-to-a-loop/44989/2?u=mlauber71

A meta collection about KNIME and performance and performance tuning and some problemsPlease be aware: there might not be the one perfect setting for every task and system. Read the entries and find the ones that fit your needs. And be aware: often there is no substituting good setup andstrong hardware performance with super clever settings - having said that there are some things you *could* check out:- overall performance of your system (allocated RAM, CPU, Disk). Give KNIME what you can but leave some for your system (cf. links)- permissions: do you (and KNIME) have full access to all files (there was an issue with MacOS)- virus scanner - KNIME has a lot of small files, make sure no virus scanner is agressivly blocking them- settings about data storage and Java heap space (and temporary folders)- restart your KNIME once with a -clean option in the knime.ini (cf. links)- maybe do a clean new installation in a new folder (if on windows maybe just un-zip the zip version into a clean folder)- check your storage/disk - be aware of possible issues with OneDrive [special characters like # and "("")" must be allowed/supported]- think about a suitable backup strategy for your work. Things can go wrong. You might want to revert to an earlier version of your work- research this collection or on the KNIME forum (forum.knime.com)Start with the official blog about KNIME performance https://www.knime.com/blog/optimizing-knime-workflows-for-performance (yes in theory such a blog might cover it all but there is more to a large system like KNIME)Check your installation (on a Windows machine use the .ZIP file if in doubt) - share debug log with the community or supporthttps://forum.knime.com/t/multiple-problems-have-occurred-message-pop-up-on-knime-startup/38655/2?u=mlauber71Long entry and thread about performance and KNIME "Process 900+ CSV files"https://forum.knime.com/t/processing-hundreds-of-millions-of-records/13593/5?u=mlauber71 Additional thread about performance and collection of linkshttps://forum.knime.com/t/large-data-tables-missing/13108/2?u=mlauber71 If you are stuck and KNIME is not responding or behaving strangely try a clean restart *once*https://forum.knime.com/t/knime-update-next-button-is-not-moving/15539/4?u=mlauber71Write table to disk and check Java heap space as options to deal with performance issueshttps://forum.knime.com/t/knime-3-6-crash-when-dealing-with-massive-data/12145/2?u=mlauber71Check your virus scanner if it hinders performancehttps://forum.knime.com/t/knime-takes-long-to-load/12669/4?u=mlauber71Think about using the Cache node at certain points and you might run the Java Garbage Collector to free memoryhttps://forum.knime.com/t/execute-failed-cannot-read-file-knime-container-20200301-1905559478659870280-bin-snappy/21461/19?u=mlauber71https://hub.knime.com/search?q=garbage&type=NodeAlso sometimes there are problems with these 'snappy' files. Switching the internal storage option might helphttps://forum.knime.com/t/blob-and-buffer-errors/25915/2?u=mlauber71Tweak the internal storage (format) of KNIME workflowshttps://forum.knime.com/t/knime-is-slowing-down/12619/2?u=mlauber71You can also try out the new "Columnar Table Backend" (based on Apache Arrow) for internal storagehttps://www.knime.com/blog/improved-performance-with-new-table-backend?u=mlauber71To speed up large data processing think about 'streaming' or using Parquet or ORC files (partitioned files)https://forum.knime.com/t/problem-with-disk-space-during-workflow-execution/48435/6?u=mlauber71Find the KNIME temporary files and foldershttps://forum.knime.com/t/question-about-memory-policy-in-node-options/36839/2?u=mlauber71Be aware that some configurations of MS OneDrive might not support all the spacial chacaters like hash (#) that KNIME workflows needhttps://forum.knime.com/t/can-not-import-workflow/11115/4?u=mlauber71https://forum.knime.com/t/can-not-open-knime-workflows-nor-create-new-workflows-after-migration-to-a-new-drive/19058/4?u=mlauber71On MacOS make sure that KNIME has all the necessary access rightshttps://forum.knime.com/t/read-permission-error-in-csv-reader/20879/24?u=mlauber71https://forum.knime.com/t/macos-catalina-cannot-read-from-file-system/24472/3?u=mlauber71=> that is also true for other systems of courseColumn rename node is not good to change the column type (sorry)https://forum.knime.com/t/knime-nodes-corrupting-on-multiple-computers/13277/7?u=mlauber71KNIME and Spark 1https://forum.knime.com/t/how-to-speed-up-the-spark-to-datebase-node/13288/2?u=mlauber71KNIME and Spark 2https://forum.knime.com/t/spark-context-how-to-cache-the-intermediate-data-and-not-write-it-back-while-doing-a-series-of-transformations-on-the-data/20881/4?u=mlauber71Streaming data in KNIMEhttps://www.knime.com/blog/streaming-data-in-knimeSave and don't save data in a workflowhttps://forum.knime.com/t/save-workflow-without-data/27843/2?u=mlauber71Create your own backup system with KNIME and ZIPhttps://forum.knime.com/t/copy-move-files-node/29412/15?u=mlauber71KNIME and Backup - think about protecting your work(flows)https://forum.knime.com/t/restore-knime-workflows/36812/2?u=mlauber71Create path variables in KNIMEhttps://forum.knime.com/t/write-multiple-csv-file-to-a-loop/44989/2?u=mlauber71 Mastering KNIME: Unlocking Peak Performance with Expert Tips and Smart Settingshttps://medium.com/p/49e2c0dab9cc A meta collection about KNIME and performance and performance tuning and some problemsPlease be aware: there might not be the one perfect setting for every task and system. Read the entries and find the ones that fit your needs. And be aware: often there is no substituting good setup andstrong hardware performance with super clever settings - having said that there are some things you *could* check out:- overall performance of your system (allocated RAM, CPU, Disk). Give KNIME what you can but leave some for your system (cf. links)- permissions: do you (and KNIME) have full access to all files (there was an issue with MacOS)- virus scanner - KNIME has a lot of small files, make sure no virus scanner is agressivly blocking them- settings about data storage and Java heap space (and temporary folders)- restart your KNIME once with a -clean option in the knime.ini (cf. links)- maybe do a clean new installation in a new folder (if on windows maybe just un-zip the zip version into a clean folder)- check your storage/disk - be aware of possible issues with OneDrive [special characters like # and "("")" must be allowed/supported]- think about a suitable backup strategy for your work. Things can go wrong. You might want to revert to an earlier version of your work- research this collection or on the KNIME forum (forum.knime.com)Start with the official blog about KNIME performance https://www.knime.com/blog/optimizing-knime-workflows-for-performance (yes in theory such a blog might cover it all but there is more to a large system like KNIME)Check your installation (on a Windows machine use the .ZIP file if in doubt) - share debug log with the community or supporthttps://forum.knime.com/t/multiple-problems-have-occurred-message-pop-up-on-knime-startup/38655/2?u=mlauber71Long entry and thread about performance and KNIME "Process 900+ CSV files"https://forum.knime.com/t/processing-hundreds-of-millions-of-records/13593/5?u=mlauber71 Additional thread about performance and collection of linkshttps://forum.knime.com/t/large-data-tables-missing/13108/2?u=mlauber71 If you are stuck and KNIME is not responding or behaving strangely try a clean restart *once*https://forum.knime.com/t/knime-update-next-button-is-not-moving/15539/4?u=mlauber71Write table to disk and check Java heap space as options to deal with performance issueshttps://forum.knime.com/t/knime-3-6-crash-when-dealing-with-massive-data/12145/2?u=mlauber71Check your virus scanner if it hinders performancehttps://forum.knime.com/t/knime-takes-long-to-load/12669/4?u=mlauber71Think about using the Cache node at certain points and you might run the Java Garbage Collector to free memoryhttps://forum.knime.com/t/execute-failed-cannot-read-file-knime-container-20200301-1905559478659870280-bin-snappy/21461/19?u=mlauber71https://hub.knime.com/search?q=garbage&type=NodeAlso sometimes there are problems with these 'snappy' files. Switching the internal storage option might helphttps://forum.knime.com/t/blob-and-buffer-errors/25915/2?u=mlauber71Tweak the internal storage (format) of KNIME workflowshttps://forum.knime.com/t/knime-is-slowing-down/12619/2?u=mlauber71You can also try out the new "Columnar Table Backend" (based on Apache Arrow) for internal storagehttps://www.knime.com/blog/improved-performance-with-new-table-backend?u=mlauber71To speed up large data processing think about 'streaming' or using Parquet or ORC files (partitioned files)https://forum.knime.com/t/problem-with-disk-space-during-workflow-execution/48435/6?u=mlauber71Find the KNIME temporary files and foldershttps://forum.knime.com/t/question-about-memory-policy-in-node-options/36839/2?u=mlauber71Be aware that some configurations of MS OneDrive might not support all the spacial chacaters like hash (#) that KNIME workflows needhttps://forum.knime.com/t/can-not-import-workflow/11115/4?u=mlauber71https://forum.knime.com/t/can-not-open-knime-workflows-nor-create-new-workflows-after-migration-to-a-new-drive/19058/4?u=mlauber71On MacOS make sure that KNIME has all the necessary access rightshttps://forum.knime.com/t/read-permission-error-in-csv-reader/20879/24?u=mlauber71https://forum.knime.com/t/macos-catalina-cannot-read-from-file-system/24472/3?u=mlauber71=> that is also true for other systems of courseColumn rename node is not good to change the column type (sorry)https://forum.knime.com/t/knime-nodes-corrupting-on-multiple-computers/13277/7?u=mlauber71KNIME and Spark 1https://forum.knime.com/t/how-to-speed-up-the-spark-to-datebase-node/13288/2?u=mlauber71KNIME and Spark 2https://forum.knime.com/t/spark-context-how-to-cache-the-intermediate-data-and-not-write-it-back-while-doing-a-series-of-transformations-on-the-data/20881/4?u=mlauber71Streaming data in KNIMEhttps://www.knime.com/blog/streaming-data-in-knimeSave and don't save data in a workflowhttps://forum.knime.com/t/save-workflow-without-data/27843/2?u=mlauber71Create your own backup system with KNIME and ZIPhttps://forum.knime.com/t/copy-move-files-node/29412/15?u=mlauber71KNIME and Backup - think about protecting your work(flows)https://forum.knime.com/t/restore-knime-workflows/36812/2?u=mlauber71Create path variables in KNIMEhttps://forum.knime.com/t/write-multiple-csv-file-to-a-loop/44989/2?u=mlauber71 Mastering KNIME: Unlocking Peak Performance with Expert Tips and Smart Settingshttps://medium.com/p/49e2c0dab9cc

Nodes

  • No nodes found

Extensions

  • No modules found

Links