Possible issue with ngrams and highlighting? Gist included. (Was: Which is the best (right) use of NGrams?)

Hi Alex,

I created a gist with my experiments:

The last technique (using query_string with whitespace analyser) does not
seem to work consistently. Is this a bug or I am doing it wrong? Shall I
open a ticket for this?

Regards,
Lukas

On Thu, Feb 21, 2013 at 5:49 AM, AlexR roytmana@gmail.com wrote:

Hi Lukas,

It will be very interesting to compare notes. I will be out of town for
few days and may not be able to conclude my test so lets touch base next
week if it's ok with you

Alex

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

On Thu, 2013-02-21 at 16:32 +0100, Lukáš Vlček wrote:

Hi Alex,

I created a gist with my experiments:
https://gist.github.com/lukas-vlcek/5005428

The last technique (using query_string with whitespace analyser) does
not seem to work consistently. Is this a bug or I am doing it wrong?
Shall I open a ticket for this?

It seems to be related to your use of the keyword tokenizer.

When using the standard tokenizer, it works:

clint

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

This seems to be working better!
Thanks Clint.

On Thu, Feb 21, 2013 at 4:56 PM, Clinton Gormley clint@traveljury.comwrote:

On Thu, 2013-02-21 at 16:32 +0100, Lukáš Vlček wrote:

Hi Alex,

I created a gist with my experiments:
https://gist.github.com/lukas-vlcek/5005428

The last technique (using query_string with whitespace analyser) does
not seem to work consistently. Is this a bug or I am doing it wrong?
Shall I open a ticket for this?

It seems to be related to your use of the keyword tokenizer.

When using the standard tokenizer, it works:

https://gist.github.com/clintongormley/5005666

clint

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.