I performed an upgrade yesterday for our ElasticSearch and Kibana instances from 5.6 to 6.8 in ElasticCloud.
We had some recordswith non-standard characters due to a previous character encoding issue when importing data using logstash.
I notice that since the upgrade to 6.8 the following query for one of these existing records will no longer returns any data:
GET my_search_alias/_search
{
"query": {
"match_phrase": {
"ARTICLE_TITLE": "©"
}
}
}
This record is present and searching for another field returns the expected value:
GET my_search_alias/_search
{
"query": {
"term": {
"MY_ID": 1234
}
}
}
...
"MY_ID" : 1234
"ARTICLE_TITLE": "MarkSE016TEST CAR1Deutscher Titel Wort1 Wort2 Wort3 Wort4 Wort5 320 © Mark Evans"
...
However, on the 5.6 version of ElasticSearch a record is retrieved when searching using the non standard character string:
GET my_search_alias/_search
{
"query": {
"match_phrase": {
"ARTICLE_TITLE": "©"
}
}
}
...
"MY_ID" : 1234
"ARTICLE_TITLE": "MarkSE016TEST CAR1Deutscher Titel Wort1 Wort2 Wort3 Wort4 Wort5 320 © Mark Evans"
...
If I create a NEW record in the 6.8 version with a copy and paste of the previous title then the new record IS retrieved by the character search query BUT the record that I copied and pasted the title from is not returned.
- Is there any difference in character searching post upgrade from 5.6 to 6.8?
- Is there a way to recompile the index or find out why the old record is not being returned since the upgrade - Even through when I create a new record with exactly the same value this is returned?
Many thanks,
Mark