Icon

00a-KNIME_​Introduction

KNIME Introduction Workflow IThese workflows show some of KNIME's features, using default nodes only.Follow the numbers in the boxes for this little introduction. 1) Text in these boxes are workflowannotations. They can be used to structurethe workflows or, like here, show someinformation to a user. 3) To configure a node, either double click it,press F6 while it is selected or right click andselect "configure...". To execute it, select the node and press F7 orright click and select "execute". All prior,connected nodes will be executed as well.To look at the output of a node, right click itand have a look at the bottom options of thecontext menu. For each output port there isone "looking glass and table" icons, if you clickthis, you see the current data at this port. 2) Data in KNIME flows from the left to theright, going through nodes, which areconnected by ports. Nodes have input andoutput ports. If you seelct a node, you willsee information about it on the right in the"Node Description" window. All availablenodes are in the "Node repository" in thebottom left and can be dragged into theworkflow using the mouse. 4) Now, execute the data Generator node and have alook at the first output port/table. during execution yousee that the little traffic light below the node switchescolor from yellow (seems ok and can be executed) togreen (successfully) executed.You see a table with four columns, three with numbers(and integer type, little I in the column header), andone text column (S for string in the header). A table isthe default data format for a KNIME port and isindicated by the black triangle.You cannot manipulate the data in the table, but cansort it by and search for results.If you are done with the inspection, close the window. 5) Let's visualize our data. We start with a box plot of our data. Ifyou look at the configuration you see that the node will take thedata from each numeric column and create an appropriate boxplot. To inspect it, right click the node and select "Interactive view:box plot". A view is a special inspection method, provided bymany nodes, which allows a data-dependend inspection of thedata.Alternatively, you can select the view of the output port, which isan image (hence a green square, not the black triangle for atable). 6) Let's make a more interesting plot: a conditional box plot.This one first clusters our data according to some category (thecluster name in this example) and create box plots of the output.After inspecting the settings and running the node, open theinteractive view. Here you see two plots: one for each cluster. Asyou see in the y-axis, data of one column only is shown. To inspectanother column, select the little menu icon in the top-rightcorner and change the selected column. 7) Finally, create a scatter plot of our data. But first we want to colorize the data according to tocluster. For this, the Color Manager can be used. If you look at the configuration, you see thatit can use nominal values (like the cluster names) with discrete colors, but also numericalvalues, using the ranges for color gradients.Run the color manager and look at the results table: each row has a little color icon. This colorwill be used in any succeeding plot nodes, like the appended Scatter Plot. After checking thesettings and executing the Scatter Plot node, you can open the interactive view, where you willsee your colored data points. In the little menu, you can switch which data is shown on the x-and y-axis.If you switch from moving the view (grab the panel and move your mouse, use the wheel forzoom) to selecting (the little check symbol), you can select data points by selecting them withboxes drawn with your mouse. Select some of the data points, select "apply" at the bottom andclose the window. If you look at the results table of the node, you can find your selected datapoints using the column "selected" (sort it descending, to quickly find them). This informationcan be used by any succeeding nodes as well, allowing you interactive data selection. make a scatter plotof our datagenerate two randomdata clusters inthree dimensionscolor the clustersBox plots ofthe dimensionsbox plot ofthe clusterScatter Plot Data Generator Color Manager Box Plot ConditionalBox Plot KNIME Introduction Workflow IThese workflows show some of KNIME's features, using default nodes only.Follow the numbers in the boxes for this little introduction. 1) Text in these boxes are workflowannotations. They can be used to structurethe workflows or, like here, show someinformation to a user. 3) To configure a node, either double click it,press F6 while it is selected or right click andselect "configure...". To execute it, select the node and press F7 orright click and select "execute". All prior,connected nodes will be executed as well.To look at the output of a node, right click itand have a look at the bottom options of thecontext menu. For each output port there isone "looking glass and table" icons, if you clickthis, you see the current data at this port. 2) Data in KNIME flows from the left to theright, going through nodes, which areconnected by ports. Nodes have input andoutput ports. If you seelct a node, you willsee information about it on the right in the"Node Description" window. All availablenodes are in the "Node repository" in thebottom left and can be dragged into theworkflow using the mouse. 4) Now, execute the data Generator node and have alook at the first output port/table. during execution yousee that the little traffic light below the node switchescolor from yellow (seems ok and can be executed) togreen (successfully) executed.You see a table with four columns, three with numbers(and integer type, little I in the column header), andone text column (S for string in the header). A table isthe default data format for a KNIME port and isindicated by the black triangle.You cannot manipulate the data in the table, but cansort it by and search for results.If you are done with the inspection, close the window. 5) Let's visualize our data. We start with a box plot of our data. Ifyou look at the configuration you see that the node will take thedata from each numeric column and create an appropriate boxplot. To inspect it, right click the node and select "Interactive view:box plot". A view is a special inspection method, provided bymany nodes, which allows a data-dependend inspection of thedata.Alternatively, you can select the view of the output port, which isan image (hence a green square, not the black triangle for atable). 6) Let's make a more interesting plot: a conditional box plot.This one first clusters our data according to some category (thecluster name in this example) and create box plots of the output.After inspecting the settings and running the node, open theinteractive view. Here you see two plots: one for each cluster. Asyou see in the y-axis, data of one column only is shown. To inspectanother column, select the little menu icon in the top-rightcorner and change the selected column. 7) Finally, create a scatter plot of our data. But first we want to colorize the data according to tocluster. For this, the Color Manager can be used. If you look at the configuration, you see thatit can use nominal values (like the cluster names) with discrete colors, but also numericalvalues, using the ranges for color gradients.Run the color manager and look at the results table: each row has a little color icon. This colorwill be used in any succeeding plot nodes, like the appended Scatter Plot. After checking thesettings and executing the Scatter Plot node, you can open the interactive view, where you willsee your colored data points. In the little menu, you can switch which data is shown on the x-and y-axis.If you switch from moving the view (grab the panel and move your mouse, use the wheel forzoom) to selecting (the little check symbol), you can select data points by selecting them withboxes drawn with your mouse. Select some of the data points, select "apply" at the bottom andclose the window. If you look at the results table of the node, you can find your selected datapoints using the column "selected" (sort it descending, to quickly find them). This informationcan be used by any succeeding nodes as well, allowing you interactive data selection. make a scatter plotof our datagenerate two randomdata clusters inthree dimensionscolor the clustersBox plots ofthe dimensionsbox plot ofthe clusterScatter Plot Data Generator Color Manager Box Plot ConditionalBox Plot

Nodes

Extensions

Links