I'm testing my app on elasticsearch-0.17.0-SNAPSHOT (built from
563ad625c0f69f3ff0f4c39f46421b1dc2c91b6f) and in my app I'm doing a
term facet on a field. I noticed a difference in behavior. If the
field contains "foo_bar", in 0.16 it would be tokenized as 2 tokens
["foo", "bar"], but in 0.17 it remains a single token ["foo_bar"]. I
have absolutely zero configuration change on my ES instance, it's a
complete vanilla install from the commit above. My mapping is created
dynamically without me specifying anything about it.
Hence my question: Did ES / Lucene start tokenizing fields differently?
Benoit "tsuna" Sigoure
Software Engineer @ www.StumbleUpon.com