ES 5.4.1
Config-1
GenerateWordParts = true, // [wi-fi] ---> [wi,fi]
GenerateNumberParts = true, // [12-03] ---> [12,03]
CatenateWords = false, // [wi-fi] -/-> [wifi]
CatenateNumbers = false, // [12-03] -/-> [1203]
CatenateAll = false, // [wi-fi-12] -/-> [wifi12]
SplitOnCaseChange = false, // [WiFi] -/-> [Wi,Fi]
PreserveOriginal = false, // [wi-fi] -/-> [wi-fi,wi,fi]
SplitOnNumerics = false, // [j2ee] -/-> [j,2,ee]
StemEnglishPossessive = true // [Jack's] ---> [Jack]
Config-2
GenerateWordParts = true, // [wi-fi] ---> [wi,fi]
GenerateNumberParts = true, // [12-03] ---> [12,03]
CatenateWords = false, // [wi-fi] -/-> [wifi]
CatenateNumbers = false, // [12-03] -/-> [1203]
CatenateAll = false, // [wi-fi-12] -/-> [wifi12]
SplitOnCaseChange = true, // [WiFi] -/-> [Wi,Fi]
PreserveOriginal = true, // [wi-fi] -/-> [wi-fi,wi,fi]
SplitOnNumerics = true, // [j2ee] -/-> [j,2,ee]
StemEnglishPossessive = true // [Jack's] ---> [Jack]
I am using these two configs of wordDelimiterFilter on same analyzer this starts giving error "startOffset must be non-negative, and endOffset must be >= startOffset, and offsets must not go backwards" on indexing text like "AtulBagga.TestConfig". Is this an issue with wordDelimiterTokenFilter on lucene?
Requirement: I want the tokens in such a way that "AtulBagga24.TestConfig" is searchable with all of the following keywords-
atul, bagga, atulbagga, atulbagga24, atulbagga24.textConfig, test, config, testconfig, 24
Is there a way to solve this if above approach has known issues?