Ingest processor vs token filter & char filter

Hi I want to ask about difference of token filter and ingest processor for lowercase and char filter html strip and ingest processor html strip, any example when use which.

Hey,

so this requires the explanation about the difference between ingest processors and analysis. The short variant is

  • An ingest processor is able modify the JSON document
  • An analysis step (character filter/token filter) is able to modify the data the gets stored in the inverted index

This means, if you retrieve the document via the Get API, the first will show modified JSON in the _source, where as the second, will show the document as you indexed it.

Hope this helps as a start.

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.