Analyzer in ingest node

rvanegmond · January 10, 2020, 3:13pm

I'm looking for a way to use the (for example path_hierarchy) tokenizer(s) in a ingest pipeline. Is there anyway of doing this?

spinscale · January 13, 2020, 1:45pm

Hey,

the tokenizer are only applied before a document is indexed. The ingest pipeline modifies the JSON before indexing starts (this is also the reason why you can dedicated ingest nodes, as these functions are split from each other).

I guess you could come up with a script processor that is doing some splitting based on a character and thus create something similar than the path hierarchy tokenizer.

--Alex

rvanegmond · January 16, 2020, 3:06pm

Thanks! Might be nice to be able to leverage the function behind the tokenizers instead of building them again in regex

spinscale · January 20, 2020, 4:10pm

Please go ahead and open an issue in the elasticsearch repo about this, explaining the rationale behind it!

--Alex

system · February 17, 2020, 4:10pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Ingest Processor Plugin with Analyzer Support Elasticsearch	5	848	April 13, 2018
Elasticsearch Ingest Pipeline + index for language identification and text analysis Elasticsearch	1	398	August 14, 2020
Ingest pipeline for text analysis? Elasticsearch	12	1536	August 20, 2020
Calling tokenizers from script - ingest processor Elasticsearch	3	678	December 23, 2016
Elasticsearch - using the path hierarchy tokenizer to access different level of categories Elasticsearch	1	465	July 6, 2017

Analyzer in ingest node

Related topics