Aggregation for similar strings

liorg2 · May 6, 2015, 3:11pm

hi guys,

i have similar string that i want to aggregate,
but they seem to be different within few characters at the end.

for example:
doc1: {"message":"hello world and good morning1 300"}
doc2: {"message":"hello world and good morning1 200"}
doc3: {"message":"hello world and good morning1 100"}

i would like to have this result in the aggregation:

"hello world and good morning1" - count: 3

the field currently defined with default analyzer

btw, is it possible to identify also more complected string such as:
doc3: {"message":"500 hello world and good morning1 100"}

thanks a lot for advanced!

Lior

--
Please update your bookmarks! We moved to https://discuss.elastic.co/

You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/7c402360-6940-400e-8a2f-933107f410a1%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Topic		Replies	Views
Aggrigation with a whole string as key Elasticsearch	1	698	November 18, 2014
Aggregate similar documents Elasticsearch	0	376	February 5, 2021
Text similarity with Elasticseacrh Elasticsearch	4	471	October 13, 2020
Aggregation by similarity Elasticsearch	0	353	December 27, 2018
Slow simple aggregation related to a not analyzed string Elasticsearch	6	1322	November 24, 2016

Aggregation for similar strings

-- Please update your bookmarks! We moved to https://discuss.elastic.co/

Related topics

--
Please update your bookmarks! We moved to https://discuss.elastic.co/