Not able to see whether data stemmed or not!

I set up an index using those settings :

{
"settings": {
"index": {
"type": "default"
},
"number_of_shards": 1,
"number_of_replicas": 0,
"analysis": {
"filter": {
"stopper": {
"type": "stop",
"stopwords": "english"
},
"stemmer_light": {
"type": "stemmer",
"name": "light_english"
},
"stemmer_possessive": {
"type": "stemmer",
"name": "possessive_english"
}
},
"analyzer": {
"new_analyzer" : {
"type": "custom",
"tokenizer": "whitespace",
"filter": ["stopper",
"lowercase",
"whitespace",
"stemmer_light",
"stemmer_possessive"]
}
}
}
}
}

After I am done with indexing I run these:
GET /index1/document/1101/_source

GET /index1/_mtermvectors/
{
"docs": [
{
"_type": "news",
"_id": "1101",
"fields": ["text"],
"term_statistics": true
}
]
}

None of these show me stemmed words and moreover all still have the stop
words in them!!! What am I doing wrong?

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/f5d6a6df-ca79-427a-9b1a-74fca04e8b40%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

I can't see any mapping set here. Could you GIST a full example?

May be you just forgot to use the analyzer you defined?

David

Le 26 janv. 2015 à 04:11, Panzer appythoncheck@gmail.com a écrit :

I set up an index using those settings :

{
"settings": {
"index": {
"type": "default"
},
"number_of_shards": 1,
"number_of_replicas": 0,
"analysis": {
"filter": {
"stopper": {
"type": "stop",
"stopwords": "english"
},
"stemmer_light": {
"type": "stemmer",
"name": "light_english"
},
"stemmer_possessive": {
"type": "stemmer",
"name": "possessive_english"
}
},
"analyzer": {
"new_analyzer" : {
"type": "custom",
"tokenizer": "whitespace",
"filter": ["stopper",
"lowercase",
"whitespace",
"stemmer_light",
"stemmer_possessive"]
}
}
}
}
}

After I am done with indexing I run these:
GET /index1/document/1101/_source

GET /index1/_mtermvectors/
{
"docs": [
{
"_type": "news",
"_id": "1101",
"fields": ["text"],
"term_statistics": true
}
]
}

None of these show me stemmed words and moreover all still have the stop words in them!!! What am I doing wrong?

You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/f5d6a6df-ca79-427a-9b1a-74fca04e8b40%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/3E9F9B05-2114-4FF2-9A1F-4725F49AB566%40pilato.fr.
For more options, visit https://groups.google.com/d/optout.

"document": {
"properties": {
"text": {
"type": "string",
"store": true,
"index": "analyzed",
"term_vector": "with_positions_offsets_payloads",
"filter": "stopper",
"analyzer": "new_analyzer"
},
"doc_length": {
"type": "long",
"store": true,
"index": "not_analyzed"
}
}
}

This was the mapping I was using.

On Sunday, January 25, 2015 at 11:42:14 PM UTC-5, David Pilato wrote:

I can't see any mapping set here. Could you GIST a full example?

May be you just forgot to use the analyzer you defined?

David

Le 26 janv. 2015 à 04:11, Panzer <appyth...@gmail.com <javascript:>> a
écrit :

I set up an index using those settings :

{
"settings": {
"index": {
"type": "default"
},
"number_of_shards": 1,
"number_of_replicas": 0,
"analysis": {
"filter": {
"stopper": {
"type": "stop",
"stopwords": "english"
},
"stemmer_light": {
"type": "stemmer",
"name": "light_english"
},
"stemmer_possessive": {
"type": "stemmer",
"name": "possessive_english"
}
},
"analyzer": {
"new_analyzer" : {
"type": "custom",
"tokenizer": "whitespace",
"filter": ["stopper",
"lowercase",
"whitespace",
"stemmer_light",
"stemmer_possessive"]
}
}
}
}
}

After I am done with indexing I run these:
GET /index1/document/1101/_source

GET /index1/_mtermvectors/
{
"docs": [
{
"_type": "news",
"_id": "1101",
"fields": ["text"],
"term_statistics": true
}
]
}

None of these show me stemmed words and moreover all still have the stop
words in them!!! What am I doing wrong?

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearc...@googlegroups.com <javascript:>.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/f5d6a6df-ca79-427a-9b1a-74fca04e8b40%40googlegroups.com
https://groups.google.com/d/msgid/elasticsearch/f5d6a6df-ca79-427a-9b1a-74fca04e8b40%40googlegroups.com?utm_medium=email&utm_source=footer
.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/d202e963-fe2a-4d8f-a699-a80e610fa20f%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.