I am configuring standard tokenizer and lowercase and I am able to perform
good search, like words between sentence, key value pair.
Index Settings
index:
analysis:
analyzer:
# set standard analyzer with no stop words as the default for both
indexing and searching
default_index:
type: custom
tokenizer: standard
filter: [standard,lowercase]
My concern is that When I search A=B then get the result for A, B and A=B
three of them, IF i make it to whitespace then it solves my problem but not
able to search words between the sentences.
How can I keep that A=B should not tokenize A=B and retains as it is, and
want to retain the Index setting
You can use a the pattern tokenizer. If you configure it with the same
delimiters as the standard tokenizer with the exception of '=', your
queries should work.
I am configuring standard tokenizer and lowercase and I am able to
perform good search, like words between sentence, key value pair.
Index Settings
index:
analysis:
analyzer:
# set standard analyzer with no stop words as the default for both
indexing and searching
default_index:
type: custom
tokenizer: standard
filter: [standard,lowercase]
My concern is that When I search A=B then get the result for A, B and A=B
three of them, IF i make it to whitespace then it solves my problem but not
able to search words between the sentences.
How can I keep that A=B should not tokenize A=B and retains as it is, and
want to retain the Index setting
can you give example of the regex that we can use??
On Monday, 16 December 2013 22:46:30 UTC+5:30, Ivan Brusic wrote:
You can use a the pattern tokenizer. If you configure it with the same
delimiters as the standard tokenizer with the exception of '=', your
queries should work.
--
Ivan
On Sun, Dec 15, 2013 at 1:30 AM, deep saxena <sandy1...@gmail.com<javascript:>
wrote:
I am configuring standard tokenizer and lowercase and I am able to
perform good search, like words between sentence, key value pair.
Index Settings
index:
analysis:
analyzer:
# set standard analyzer with no stop words as the default for both
indexing and searching
default_index:
type: custom
tokenizer: standard
filter: [standard,lowercase]
My concern is that When I search A=B then get the result for A, B and A=B
three of them, IF i make it to whitespace then it solves my problem but not
able to search words between the sentences.
How can I keep that A=B should not tokenize A=B and retains as it is,
and want to retain the Index setting
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.