Elasicsearch- Indexing documents based on change of sequence of characters

I have a set of documents that has to be indexed in the following way:

  1. fs$#56s as fs,$#,56,s
  2. FSand-hgfh as fsand,-,hgfh

Is there a way to do so? Thank you!

You need to create your own analyser for that, have a read of this chapter - https://www.elastic.co/guide/en/elasticsearch/reference/6.4/analysis.html

Thanks for the quick response.
I did go through the chapter. I feel pattern tokenizer can be used to achieve this but I am unable to provide a proper pattern to token my document text.
Is there any default pattern available?

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.