I use the standard tokenizer and I don't use the html_strip
char filter.
How can I index HTML tags?
In fact, I want to be able to search with and without the <
and >
characters. I.e. a search for <section>
should match This is about the <section> tag
, but it should not match In this section we talks about stuff
. The standard tokenizer will turn that (search) text to ["section"]
.
As a bonus, if this can be done I don't have to worry about the stop
char filter turning <a>
into []
.