Parsing Geo Data from text documents

antman1p · July 18, 2017, 6:37am

I am currently have a python script that watches a directory and every time a new file shows up, it calls Apache Tika to parse the file. I am parsing a WIDE variety of document types and the script posts the parsed data to the appropriate type in an elastic search index.
My question is:
Is there a way for Elastic search to recognize various geo-coordinates from the parsed text of the various document types and possibly automatically pin them on the Kibana map?

warkolm · July 18, 2017, 6:55am

If the coordinates are in a specific field and you use a template/mapping for that field, then yes.

antman1p · July 19, 2017, 12:43am

What if they're not? Is there a plugin or anything that could search all of the parsed text for strings that look like geocoords and tag them in a way that they can be recognized as such?

warkolm · July 19, 2017, 1:13am

You could try https://github.com/spinscale/elasticsearch-ingest-opennlp

antman1p · July 19, 2017, 6:18am

I read the documentation for the repo you linked, but am confused as to what exactly it does.

warkolm · July 19, 2017, 6:29am

It uses natural language processing to do entity extraction, an entity in this instance could be a location.

I guess the simple way of putting this is there isn't a really easy way to do this at the moment, unless you want to spend $$$ on a off the shelf solution or spend some time to build something out of base technology like OpenNLP.

system · August 16, 2017, 6:29am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Geo_point object mapped to text field in elasticsearch Elasticsearch	2	356	April 23, 2020
Logstash geo_point Logstash	16	671	October 25, 2018
Extract Coordinates From Field "log" and Create A New GeoPoint type from it Logstash	4	862	May 1, 2018
Geo point as string Elasticsearch	8	3439	July 5, 2017
Don't know how to make elasticsearch recognise the location coordenates properly for Map "visualitzation" on Kibana Logstash	4	513	December 23, 2019

Parsing Geo Data from text documents

Related topics