Aggregation for similar strings


(liorg2) #1

hi guys,

i have similar string that i want to aggregate,
but they seem to be different within few characters at the end.

for example:
doc1: {"message":"hello world and good morning1 300"}
doc2: {"message":"hello world and good morning1 200"}
doc3: {"message":"hello world and good morning1 100"}

i would like to have this result in the aggregation:

"hello world and good morning1" - count: 3

the field currently defined with default analyzer

btw, is it possible to identify also more complected string such as:
doc3: {"message":"500 hello world and good morning1 100"}

thanks a lot for advanced!

Lior

--
Please update your bookmarks! We moved to https://discuss.elastic.co/

You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/7c402360-6940-400e-8a2f-933107f410a1%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


(system) #2