Whitespace analyzer

I am using whitespace analyzer and I have a text of format "p1->p2->p3". My
assumption is that when using whitespace analyzer this text will not be
broken down into terms p1,p2 and p3. Is that correct?

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

Hey,

you are right, as a whitespace analyzer splits by whitespace, which is not
included here. It is easy for you to verify (you can try the standard
analyzer to get a different behaviour):

curl -X POST 'localhost:9200/_analyze?analyzer=whitespace&pretty' -d
'p1->p2->p3'

{
"tokens" : [ {
"token" : "p1->p2->p3",
"start_offset" : 0,
"end_offset" : 10,
"type" : "word",
"position" : 1
} ]
}

As you can se with the analyze API, your input remains one token..

--Alex

On Tue, May 7, 2013 at 8:33 PM, Mohit Anchlia mohitanchlia@gmail.comwrote:

I am using whitespace analyzer and I have a text of format "p1->p2->p3".
My assumption is that when using whitespace analyzer this text will not be
broken down into terms p1,p2 and p3. Is that correct?

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

Thanks! Is it possible to run _analyze on top of existing index? I had an
index with standard tokenizer which I converted to whitespace but it
doesn't seem to be working as expected:

curl -XPOST 'localhost:9200/pflow1/_close'

curl -XPUT 'localhost:9200/pflow1/_settings' -d '{
"analysis" : {
"analyzer":{
"content":{
"type":"whitespace",
"tokenizer":"whitespace"
}
}
}
}'

curl -XPOST 'localhost:9200/pflow1/_open'

On Tue, May 7, 2013 at 11:37 AM, Alexander Reelsen alr@spinscale.de wrote:

Hey,

you are right, as a whitespace analyzer splits by whitespace, which is not
included here. It is easy for you to verify (you can try the standard
analyzer to get a different behaviour):

curl -X POST 'localhost:9200/_analyze?analyzer=whitespace&pretty' -d
'p1->p2->p3'

{
"tokens" : [ {
"token" : "p1->p2->p3",
"start_offset" : 0,
"end_offset" : 10,
"type" : "word",
"position" : 1
} ]
}

As you can se with the analyze API, your input remains one token..

--Alex

On Tue, May 7, 2013 at 8:33 PM, Mohit Anchlia mohitanchlia@gmail.comwrote:

I am using whitespace analyzer and I have a text of format "p1->p2->p3".
My assumption is that when using whitespace analyzer this text will not be
broken down into terms p1,p2 and p3. Is that correct?

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

Hey,

you can use any analyzer you want by specifying it, see

--Alex

On Tue, May 7, 2013 at 8:41 PM, Mohit Anchlia mohitanchlia@gmail.comwrote:

Thanks! Is it possible to run _analyze on top of existing index? I had an
index with standard tokenizer which I converted to whitespace but it
doesn't seem to be working as expected:

curl -XPOST 'localhost:9200/pflow1/_close'

curl -XPUT 'localhost:9200/pflow1/_settings' -d '{
"analysis" : {
"analyzer":{
"content":{
"type":"whitespace",
"tokenizer":"whitespace"
}
}
}
}'

curl -XPOST 'localhost:9200/pflow1/_open'

On Tue, May 7, 2013 at 11:37 AM, Alexander Reelsen alr@spinscale.dewrote:

Hey,

you are right, as a whitespace analyzer splits by whitespace, which is
not included here. It is easy for you to verify (you can try the standard
analyzer to get a different behaviour):

curl -X POST 'localhost:9200/_analyze?analyzer=whitespace&pretty' -d
'p1->p2->p3'

{
"tokens" : [ {
"token" : "p1->p2->p3",
"start_offset" : 0,
"end_offset" : 10,
"type" : "word",
"position" : 1
} ]
}

As you can se with the analyze API, your input remains one token..

--Alex

On Tue, May 7, 2013 at 8:33 PM, Mohit Anchlia mohitanchlia@gmail.comwrote:

I am using whitespace analyzer and I have a text of format "p1->p2->p3".
My assumption is that when using whitespace analyzer this text will not be
broken down into terms p1,p2 and p3. Is that correct?

--
You received this message because you are subscribed to the Google
Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send
an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.