Icon

Dataset cleaning and formatting

There has been no title set for this workflow's metadata.

This workflow demonstrates how to use the "String Format Manager" node in conjunction with the "Number Format Manager" and "String Cleaner" nodes. The format manager nodes are in the node repository under the Views category. On the other hand, the "String Cleaner" node can be found in the Manipulation category.

You can easily download and run the workflow directly in your KNIME installation. We recommend that you use the latest version of the KNIME Analytics Platform for optimal performance. It can also be deployed as a Data App in KNIME Business Hub.

The "String Format Manager" and "Number Format Manager" nodes can format strings and numbers for better visualization. The "String Cleaner" node assists in common data-cleaning tasks. Combining these nodes can create a helpful toolbox for plotting datasets on a table view.

We created synthetic data to showcase the node's capabilities. It contains fake camera model information, including descriptions, reviews, and technical characteristics.

The "Table View" provides a clear display of the results of data cleaning and formatting. For instance, the "Price" column shows the removal of decimals and using apostrophes as thousand separators. The camera descriptions are wrapped for convenience, and the fake links are now clickable. Additionally, we have standardized the number of decimal places to four for all columns with technical information about the cameras.

URL: Synthetic Data Wikipedia https://en.wikipedia.org/wiki/Synthetic_data

Nodes

Extensions

Links