Icon

01_​Creating_​A_​Corpus

Creating a Corpus of Documents

This workflow parses the drug list and descriptions from the WHO website and creates a corpus by using the Document Grabber to fetch articles from PubMed. The functionality is bundled into components for better clarity.

Get the drug list Component to get articles from PubMed by using the Document Grabbernode. It automatically iterates over the drug list and catches failing HTTPrequests. This workflow parses a list of drugs from the WHO website and automatically creates a corpus of biomedical documents. Port 1: Drug / ATC CodePort 2: Drug combinationsPort 3: ATC Code descriptionsWrite todata folderWrite drug / ATC codeto data folderWrite ATC code descriptionsto data folder Parse drugs anddescriptions Gathering articles Table Writer Table Writer Table Writer Get the drug list Component to get articles from PubMed by using the Document Grabbernode. It automatically iterates over the drug list and catches failing HTTPrequests. This workflow parses a list of drugs from the WHO website and automatically creates a corpus of biomedical documents. Port 1: Drug / ATC CodePort 2: Drug combinationsPort 3: ATC Code descriptionsWrite todata folderWrite drug / ATC codeto data folderWrite ATC code descriptionsto data folder Parse drugs anddescriptions Gathering articles Table Writer Table Writer Table Writer

Nodes

Extensions

Links