The “Location Extractor” node allows to extract geographic locations from unstructured English text. This node uses Palladian’s location extraction mechanism.
The location extraction algorithm performs various steps for recognizing potential locations within a given text, followed by a disambiguation. The disambiguation step checks hierarchical/contains relations and identifies correct locations by their proximity to other given locations in the text.
Each identified location in the text is returned, multiple occurrences are returned as often as they occur. Extracted locations are classified into the following categories:
For each location, geographical coordinates with longitude and latitude values are provided. They are in WGS84 decimal degrees.
In order to use the “Location Extractor”, a “Location Source” (also known as Gazetteer) must be configured. The Location Source provides a database with real world locations and meta information such as alternative names, population figures, coordinates, and hierarchical relations. You can select and configure Location Sources in the KNIME Preferences under KNIME → Palladian → Location Extractor.
There are two Location Sources available:
GeoNames: We currently provide a freely usable Location Source for the GeoNames API. This Location Source allows 30,000 REST requests/day, 2,000 REST request/hour. To add the GeoNames source, click the “New…” button, and follow the instructions and the link to create a free GeoNames account. We suggest to enable the option to retrieve location hierarchies to improve the Location Extractor’s results. This however causes an additional API request for every found location.
Local Gazetteer: In case, you want to keep your data private, you’re running out of GeoName’s request limit, or you significantly want to speed up operations, we provide a separate plugin, which allows to setup a local gazetteer on your machine without accessing the Web. Contact us at email@example.com, if you are interested.
You want to see the source code for this node? Click the following button and we’ll use our super-powers to find it for you.
A zipped version of the software site can be downloaded here.
Do you have feedback, questions, comments about NodePit, want to support this platform, or want your own nodes or workflows listed here as well? Do you think, the search results could be improved or something is missing? Then please get in touch! Alternatively, you can send us an email to firstname.lastname@example.org, follow @NodePit on Twitter, or chat on Gitter!
Please note that this is only about NodePit. We do not provide general support for KNIME — please use the KNIME forums instead.