The MergeTwoVCFs node is based on the GATK CombineVariants tool.
It reads in variants records from two separate ROD (Reference-Ordered Data) sources and combines them into a single VCF.
This tool aims to fulfill two main possible use cases:
1.) It combines variant records present at the same site in the different input sources into a single variant record in the output.
2.) It assumes that each ROD source represents the same set of samples (although this is not enforced). It uses the priority list (if provided) to emit a single record instance at every position represented in the input RODs.
This node can for example merge the output VCLs file from two different variant calling tools (e.g. Pindel and GATKHaplotypeCaller).
For further information, see GATK documentation of CombineVariants.
You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.
To use this node in KNIME, install the extension KNIME4NGS from the below update site following our NodePit Product and Node Installation Guide:
Deploy, schedule, execute, and monitor your KNIME workflows locally, in the cloud or on-premises – with our brand new NodePit Runner.
Try NodePit Runner!