Icon

05_​Data_​Manipulation_​Using_​Pandas_​and_​PyArrow

Data Manipulation Using Pandas and PyArrow

This workflow demonstrates the usage of python package pandas and pyarrow for data manipulation. The KNIME Table is read as pandas dataframe in the first branch of the workflow in the Python Script node, while it is read as a PyArrow table in the second branch in the Python Script node.

Further the data is normalised using the python commands of the respective python packages.

Adopting Python Bundled Packages : Download and open this workflow to auto-install requirements and execute the Python scripts. This workflow uses a node from KNIMEPython Integration, which installs Python and a number of pre-defined packages on your system. Supported on KNIME Analytics Platform4.7 or higher. Configure the Python Script node for data manipulation The Python Script node can be provide with a table or an object as input portand it can output a table, object or an image based on the user's selection ofportsStep 1: Drag the Python Script node to KNIME Analytics PlatformStep 2: Click on the three dots on the node to add an output table portStep 3: Write your script in the "Script" tab and import the pandas or pyarrowpackage to perform data manipulation. Press "execute script" to check if theexecution was successful. perform data manipulationuisng PyArrow packageperform data manipulationuisng Pandas packagedownstream taskdownstream taskAirline Data Python Script Python Script Math Formula Math Formula Table Reader Adopting Python Bundled Packages : Download and open this workflow to auto-install requirements and execute the Python scripts. This workflow uses a node from KNIMEPython Integration, which installs Python and a number of pre-defined packages on your system. Supported on KNIME Analytics Platform4.7 or higher. Configure the Python Script node for data manipulation The Python Script node can be provide with a table or an object as input portand it can output a table, object or an image based on the user's selection ofportsStep 1: Drag the Python Script node to KNIME Analytics PlatformStep 2: Click on the three dots on the node to add an output table portStep 3: Write your script in the "Script" tab and import the pandas or pyarrowpackage to perform data manipulation. Press "execute script" to check if theexecution was successful. perform data manipulationuisng PyArrow packageperform data manipulationuisng Pandas packagedownstream taskdownstream taskAirline Data Python Script Python Script Math Formula Math Formula Table Reader

Nodes

Extensions

Links