Not able to see whether data stemmed or not!

Panzer · January 26, 2015, 3:11am

I set up an index using those settings :

{
"settings": {
"index": {
"type": "default"
},
"number_of_shards": 1,
"number_of_replicas": 0,
"analysis": {
"filter": {
"stopper": {
"type": "stop",
"stopwords": "english"
},
"stemmer_light": {
"type": "stemmer",
"name": "light_english"
},
"stemmer_possessive": {
"type": "stemmer",
"name": "possessive_english"
}
},
"analyzer": {
"new_analyzer" : {
"type": "custom",
"tokenizer": "whitespace",
"filter": ["stopper",
"lowercase",
"whitespace",
"stemmer_light",
"stemmer_possessive"]
}
}
}
}
}

After I am done with indexing I run these:
GET /index1/document/1101/_source

GET /index1/_mtermvectors/
{
"docs": [
{
"_type": "news",
"_id": "1101",
"fields": ["text"],
"term_statistics": true
}
]
}

None of these show me stemmed words and moreover all still have the stop
words in them!!! What am I doing wrong?

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/f5d6a6df-ca79-427a-9b1a-74fca04e8b40%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

dadoonet · January 26, 2015, 4:42am

I can't see any mapping set here. Could you GIST a full example?

May be you just forgot to use the analyzer you defined?

David

Le 26 janv. 2015 à 04:11, Panzer appythoncheck@gmail.com a écrit :

I set up an index using those settings :

{
"settings": {
"index": {
"type": "default"
},
"number_of_shards": 1,
"number_of_replicas": 0,
"analysis": {
"filter": {
"stopper": {
"type": "stop",
"stopwords": "english"
},
"stemmer_light": {
"type": "stemmer",
"name": "light_english"
},
"stemmer_possessive": {
"type": "stemmer",
"name": "possessive_english"
}
},
"analyzer": {
"new_analyzer" : {
"type": "custom",
"tokenizer": "whitespace",
"filter": ["stopper",
"lowercase",
"whitespace",
"stemmer_light",
"stemmer_possessive"]
}
}
}
}
}

After I am done with indexing I run these:
GET /index1/document/1101/_source

GET /index1/_mtermvectors/
{
"docs": [
{
"_type": "news",
"_id": "1101",
"fields": ["text"],
"term_statistics": true
}
]
}

None of these show me stemmed words and moreover all still have the stop words in them!!! What am I doing wrong?

You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/f5d6a6df-ca79-427a-9b1a-74fca04e8b40%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/3E9F9B05-2114-4FF2-9A1F-4725F49AB566%40pilato.fr.
For more options, visit https://groups.google.com/d/optout.

Panzer · January 26, 2015, 12:40pm

"document": {
"properties": {
"text": {
"type": "string",
"store": true,
"index": "analyzed",
"term_vector": "with_positions_offsets_payloads",
"filter": "stopper",
"analyzer": "new_analyzer"
},
"doc_length": {
"type": "long",
"store": true,
"index": "not_analyzed"
}
}
}

This was the mapping I was using.

On Sunday, January 25, 2015 at 11:42:14 PM UTC-5, David Pilato wrote:

I can't see any mapping set here. Could you GIST a full example?

May be you just forgot to use the analyzer you defined?

David

Le 26 janv. 2015 à 04:11, Panzer <appyth...@gmail.com <javascript:>> a
écrit :

I set up an index using those settings :

{
"settings": {
"index": {
"type": "default"
},
"number_of_shards": 1,
"number_of_replicas": 0,
"analysis": {
"filter": {
"stopper": {
"type": "stop",
"stopwords": "english"
},
"stemmer_light": {
"type": "stemmer",
"name": "light_english"
},
"stemmer_possessive": {
"type": "stemmer",
"name": "possessive_english"
}
},
"analyzer": {
"new_analyzer" : {
"type": "custom",
"tokenizer": "whitespace",
"filter": ["stopper",
"lowercase",
"whitespace",
"stemmer_light",
"stemmer_possessive"]
}
}
}
}
}

After I am done with indexing I run these:
GET /index1/document/1101/_source

GET /index1/_mtermvectors/
{
"docs": [
{
"_type": "news",
"_id": "1101",
"fields": ["text"],
"term_statistics": true
}
]
}

None of these show me stemmed words and moreover all still have the stop
words in them!!! What am I doing wrong?

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearc...@googlegroups.com <javascript:>.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/f5d6a6df-ca79-427a-9b1a-74fca04e8b40%40googlegroups.com
https://groups.google.com/d/msgid/elasticsearch/f5d6a6df-ca79-427a-9b1a-74fca04e8b40%40googlegroups.com?utm_medium=email&utm_source=footer
.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/d202e963-fe2a-4d8f-a699-a80e610fa20f%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Topic		Replies	Views
Elasticsearch Analyzer:Stemmer giving different results Elasticsearch	1	376	February 6, 2019
How to stem, remove stop words from my index data? Elasticsearch	4	2197	July 4, 2019
Elastic and language stem (polish) Elasticsearch	2	1578	July 6, 2017
Stemmer not working [ES 6.7.1] Elasticsearch	2	490	May 7, 2019
Elasticsearch stemmer issue Elasticsearch	1	313	July 6, 2017

Not able to see whether data stemmed or not!

None of these show me stemmed words and moreover all still have the stop words in them!!! What am I doing wrong?

Related topics