Adding custom token filter in Elastic Search for retrieving skip-grams

Aashu_singh · December 22, 2014, 4:16pm

I searched online and found that shingles tokenizer that comes
pre-installed with elastic search can give bigrams, trigrams etc.

I want to retrieve skip-grams from my documents for indexing, along with
words, bigrams and trigrams.

Further search revealed that I might have to write a custom plugin for such
tokenizer. But I could not find proper documentation for writing one. Can
anyone point me to the right resources which I might need for the task.

Thanks!

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/ecb26d94-1255-4402-a560-359df78e29e1%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Topic		Replies	Views
Is it possible to query an elasticsearch shingle index for the tokens in the index with number of occurrences? Elasticsearch	1	380	March 28, 2019
Shingle token filter on array field Elasticsearch	1	415	July 6, 2017
EdgeNgram tokenizer: Get only high quality hits? Elasticsearch	2	344	July 6, 2017
Recreating Google's Ngram Viewer with elasticsearch Elasticsearch	1	528	July 6, 2017
Arabic Tokenizer Elasticsearch	4	2619	July 6, 2017

Adding custom token filter in Elastic Search for retrieving skip-grams

Related topics