I am trying to create a Tagging mechanism in Elastic search that checks for
the text in the document and if the text if found then it tags the document
with the corresponding text_id
Eg. Tagging for Company names, text = "Google Inc", text_id = "GOOG_123"
If a document has the text "Google Inc", then tag it with the id =
"GOOG_123".
A. Insertion ?
- Is there a way in Elastic Search to achieve this at the time of
Insertion ?
- If not, is there a way to tweak the Java code in Lucene to achieve
this ?
B. Post processing
- If it is not possible to do at insertion, how can we better achieve
this with post processing ? Running Fuzzy or Term search ?
IMO this is document enrichment. You should do that on your side before sending to elasticsearch.
You can use percolator to do some classification of your docs.
I am trying to create a Tagging mechanism in Elastic search that checks for the text in the document and if the text if found then it tags the document with the corresponding text_id
Eg. Tagging for Company names, text = "Google Inc", text_id = "GOOG_123"
If a document has the text "Google Inc", then tag it with the id = "GOOG_123".
A. Insertion ?
- Is there a way in Elastic Search to achieve this at the time of Insertion ?
- If not, is there a way to tweak the Java code in Lucene to achieve this ?
B. Post processing
- If it is not possible to do at insertion, how can we better achieve this with post processing ? Running Fuzzy or Term search ?
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.