PandaSQL Join

Uses Python pandasSql to join two input tables

The supplied query should reference the top input table as t1 and the lower input table as t2.

t1 is the LEFT (upper input) table in the join
t2 is the RIGHT (lower input) table in the join

e.g.
select * from t1
inner join t2 on t1.mydate between t2.date1 and t2.date2


NOTE, to use this component you must have the following installed and configured in your KNIME environment:
Python 3
numpy
pandas
pandassql

e.g. to manually install the packages, using pip, outside of KNIME
python -m pip install numpy
python -m pip install pandas
python -m pip install pandasql

27 May 2021 @takbb Brian Bates

Options

SQL Query between t1 (left) and t2 (right)
The query should use SQL (pandasql) format, and should refer to the Top [LEFT] table as t1, and the bottom [RIGHT] input table as t2

Input Ports

Icon
The left data table to be included in the join
Icon
The right data table to be included in the join

Output Ports

Icon
The resultant output table from the joining of t1 with t2 using the supplied pandasql query

Nodes

Extensions

Links