Hello,
I've a problem on highlighting. When I search for the keyword "éolienne", Elastic highlights l' instead of the l'éolienne. (see picture problem_highlight.png)
The complete sentene is :
"""Daulitz, Domaine de Larroque Vieil, Ferme de Clèche, impasse As Prats, impasse de la Forge, impasse de l’Eolienne, impasse du Merle, impasse du Pinson, impasse Flourac, impasse Loubine, impasse Lucie Aubrac, impasse Pierre Dupont, impasse Redorthe, impasse Roundy"""
And I got this in highlight :
"""Daulitz, Domaine de Larroque Vieil, Ferme de Clèche, impasse As Prats,
impasse de la Forge, impasse de l’"""
My query is :
{
"highlight": {
"boundary_scanner": "sentence",
"boundary_scanner_locale":"fr-FR",
"fields": {
"*": {}
}
},
"query": {
"bool": {
"boost": 1,
"filter": ,
"must": [
{
"multi_match": {
"fields": "texte_extrait.raw",
"query": "éolienne",
"type": "phrase"
}
}
]
}
}
}
In the same index, I've correct highlights in other documents. (see picture highlight_ok.png)
I think there is a problem with sentence splitting. Do you have a solution to resolve this problem ?
Thanks a lot.
Minwei DENG