0 ×

01_​Creating_​A_​Corpus

Workflow

Creating a Corpus of Documents

This workflow parses the drug list and descriptions from the WHO website and creates a corpus by using the Document Grabber to fetch articles from PubMed. The functionality is bundled into components for better clarity.

WHODocument Grabberdrug predictionText mining
Get the drug list Component to get articles from PubMed by using the Document Grabbernode. It automatically iterates over the drug list and catches failing HTTPrequests. This workflow parses a list of drugs from the WHO website and automatically creates a corpus of biomedical documents. Port 1: Drug / ATC CodePort 2: Drug combinationsPort 3: ATC Code descriptionsWrite todata folderWrite drug / ATC codeto data folderWrite ATC code descriptionsto data folder Parse drugs anddescriptions Gathering articles Table Writer Table Writer Table Writer Get the drug list Component to get articles from PubMed by using the Document Grabbernode. It automatically iterates over the drug list and catches failing HTTPrequests. This workflow parses a list of drugs from the WHO website and automatically creates a corpus of biomedical documents. Port 1: Drug / ATC CodePort 2: Drug combinationsPort 3: ATC Code descriptionsWrite todata folderWrite drug / ATC codeto data folderWrite ATC code descriptionsto data folder Parse drugs anddescriptions Gathering articles Table Writer Table Writer Table Writer

Download

Get this workflow from the following link: Download

Resources

Nodes

01_​Creating_​A_​Corpus consists of the following 69 nodes(s):

Plugins

01_​Creating_​A_​Corpus contains nodes provided by the following 5 plugin(s):