I guess the word "max" in "max_shingle_size" should have given me a
clue on how it operates! The example confused me since it did not list
the shingles with one term.
Need to play around with it, I might have a use for it as well.
On Thu, Sep 8, 2011 at 8:11 PM, Matt Weber email@example.com wrote:
Not sure what you mean by same size. If you have "a b c" and "a b d c" and
set a max_shingle_size = 4 and output_unigrams = true, we should get these
2 - a
2 - b
2 - c
2 - a b
1 - d
1 - b c
1 - b d
1 - d c
1 - a b c
1 - a b d
1 - b d c
1 - a b d c
On Thu, Sep 8, 2011 at 3:09 PM, Ivan Brusic firstname.lastname@example.org wrote:
Wouldn't the shingles need to be the same size?
On Wed, Sep 7, 2011 at 12:31 PM, Matt Weber email@example.com wrote:
Yea, tokenize the field into shingles:
On Wed, Sep 7, 2011 at 8:32 AM, Pulkit Agrawal firstname.lastname@example.org
I am looking for facet on phrases.
e.g: suppose I have field text with value "abc xyz def" and "abc xyz
Now I need some help to get facet count like "abc xyz"- count:"2" ,
"abc"-count:"2", "xyz"- count:"2", "def" - count:"2".
Is it possible with elasticsearch? I think elasticsearch is a awesome
product there must be something to get this.
Thanks in advance.