How to extract found terms into keyword fields

Mark_Harwood · April 25, 2019, 4:18pm

This sounds like an entity extraction problem and fortunately twitter handles and hashtags are easily identified using a simple regular expression.
I tend to use Python code to prepare docs but this is a personal choice and Logstash or ingest pipelines are other document-enrichment tools. This question explores the same problem.
Either way, you should be OK to have a plain doc with your original text field and use a keyword type structured field with an array of the extracted handles or tags. If you want to remember where these handles were extracted from the text it might be an idea to use an annotated_text field instead.

Topic		Replies	Views
How to extract terms into a new field while indexing Elasticsearch	6	876	May 1, 2019
Aggregations based on text fields instead of keyword fields Elasticsearch	9	1283	April 29, 2019
Extract Hashtags and Mentions into separate fields Elasticsearch	3	1043	January 13, 2022
Isolating Hashtags in Twitter Feed Logstash	8	583	April 23, 2018
Identifying Significant Words In a Field Kibana	8	642	May 1, 2018

How to extract found terms into keyword fields

Related topics