A component to download a set of FASTQ files under a certain project/study/experment by providing an accession ID from the European Nucleotide Archive (ENA). The component works by first getting a summary table of samples belonging to the provided accession number. In this summary table are paths (ftp) to zipped FASTQ files of individual samples.
Using these paths, zipped FASTQ files are downloaded and stored under a directory named data/
Supported accession types are Projects, Studies, BioSamples, Samples, Experiments, Runs and Analyses. Refer to https://ena-docs.readthedocs.io/en/latest/submit/general-guide/accessions.html to see details. The component is able to handle both single and paired library layouts in a seamless fashion.
Note:
Files will not be downloaded if their up-to-date version already exists under data/
To use this component in KNIME, download it from the below URL and open it in KNIME:
Download ComponentDeploy, schedule, execute, and monitor your KNIME workflows locally, in the cloud or on-premises – with our brand new NodePit Runner.
Try NodePit Runner!