Force external tokenization/lemmatization

levi-strauss · March 13, 2018, 3:19pm

Hi,

is there any other way to force custom tokenization and lemmatization besides writing a custom Token Filter plugin?

I would like to synchronize products of custom analysis (done first) with data stored in elasticsearch using a text type (done second). My aim is to use elastic for fulltext search and highlight.

I've been playing with the ingest node, but I can't see any way to force tokenization/lemmatization using processors. (I was hoping to combine somehow the two streams of tokens and lemmas into a searchable text field.) Do I have the right impression?

Thanks in advance for any suggestions or tips,

ls

system · April 10, 2018, 3:19pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Using spaCy lemmatizers with ElasticSearch Elasticsearch	1	438	November 26, 2020
Elasticsearch : Access the lemmas of the analyzed text Elasticsearch	1	261	June 22, 2022
HOw set lemmatization in ES.Like id i search for "update" it should be searching for "updated" and "updating" too Elasticsearch	2	807	July 6, 2017
[Ann] Elasticsearch Analysis Baseform Plugin 1.0.0 Elasticsearch	3	481	July 6, 2017
Want to implement wordnet lemmatizer Elasticsearch	1	1001	July 6, 2017

Force external tokenization/lemmatization

Related topics