Can you tune the split_on_case_change setting on the word delimiter token filter?

I have a love-hate relationship with split_on_case_change
My problem is that only certain words make sense to split on case change. For example, I'd like to treat TypedArray as Typed, Array. But I don't want to split up forEach into for,each because then for is treated as the common stop word that it is.

I think it would be nice if I could configure split_on_case_change such that it only splits by case if the first character is [A-Z]. E.g. [A-Z][a-z]+[A-Z][a-z]+.... But not for [a-z]+[A-z][a-z]+....

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.