Please let me know what I'm doing wrong or where to look/debug.
- I git cloned https://github.com/asyncee/elasticsearch-russian-config/
- Downloaded elasticsearch-1.4.2 and copied bin and lib into the same dir.
- Installed https://github.com/imotov/elasticsearch-analysis-morphology 1.2
Run it.
Now looks like I have russian analyzer, at least this test gives correct
tokens:
curl -XGET "http://localhost:9200/_analyze?analyzer=russian&text=Веселые%20истории%20про%20котят"
...But then I create an index:
curl -XPOST "http://localhost:9200/blog2" -d'
{
"settings": {
"analysis": {
"filter": {
"ru_stop": {
"type": "stop",
"stopwords": "russian"
},
"ru_stemmer": {
"type": "stemmer",
"language": "russian"
}
},
"analyzer": {
"default": {
"char_filter": [
"html_strip"
],
"tokenizer": "standard",
"filter": [
"lowercase",
"ru_stop",
"ru_stemmer"
]
}
}
}
},
"mappings": {
"post": {
"properties": {
"content": {
"type": "string"
},
"published_at": {
"type": "date",
"format": "dateOptionalTime"
},
"tags": {
"type": "string",
"index": "not_analyzed"
},
"title": {
"type": "string"
}
}
}
}
}'
And insert the record:
curl -XPUT "http://localhost:9200/blog2/post/2" -d'
{
"title": "Веселые щенки",
"content": "
Смешная история про щенков
",
"tags": [
"щенки",
"смешная история"
],
"published_at": "2014-08-12T20:44:42+00:00"
}'
Now I can find it with
-> POST http://localhost:9200/blog2/post/_search
{
"query": {
"match": {
"title": "щенки"
}
}
}
But not if I provide a single "щенок" instead of the plural "щенки".
So, basically, the morphology doesn't work.
P.S. I'm kind of new to elasticsearch.
--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/8368b6a9-5b21-4f5a-bf8e-c3ad6336f937%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.