Can anyone help me understand what is going on here?
I checked how the data is being analyzed by running:
tiklup-mac:~ pulkitsinghal$ curl -XGET 'http://localhost:9200/bbyopen/
_analyze?pretty=true' -d 'PULKIT SINGHAL'
And it is understandably being divided into two terms - "pulkit" and
"singhal"
{
"tokens" : [ {
"token" : "pulkit",
"start_offset" : 0,
"end_offset" : 6,
"type" : "",
"position" : 1
}, {
"token" : "singhal",
"start_offset" : 7,
"end_offset" : 14,
"type" : "",
"position" : 2
} ]
Can anyone help me understand what is going on here?
I checked how the data is being analyzed by running:
tiklup-mac:~ pulkitsinghal$ curl -XGET 'http://localhost:9200/bbyopen/
_analyze?pretty=true' -d 'PULKIT SINGHAL'
And it is understandably being divided into two terms - "pulkit" and
"singhal"
{
"tokens" : [ {
"token" : "pulkit",
"start_offset" : 0,
"end_offset" : 6,
"type" : "",
"position" : 1
}, {
"token" : "singhal",
"start_offset" : 7,
"end_offset" : 14,
"type" : "",
"position" : 2
} ]
Can anyone help me understand what is going on here?
I checked how the data is being analyzed by running:
tiklup-mac:~ pulkitsinghal$ curl -XGET 'http://localhost:9200/bbyopen/
_analyze?pretty=true' -d 'PULKIT SINGHAL'
And it is understandably being divided into two terms - "pulkit" and
"singhal"
{
"tokens" : [ {
"token" : "pulkit",
"start_offset" : 0,
"end_offset" : 6,
"type" : "",
"position" : 1
}, {
"token" : "singhal",
"start_offset" : 7,
"end_offset" : 14,
"type" : "",
"position" : 2
} ]
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.