please share your full setup, as it's not clear here what the englishtoken analyzer consists off in terms of tokenizer and token filters. This will prevent guessing on our side
This is the full analyzer for enarticles!
$params = [
'index' => 'enarticles',
'body' => [
'settings' => [
'number_of_shards' => 3,
'number_of_replicas' => 2,
Just quickly chiming in that what you're running into is the challenge of how stemming treats plurals. The term courses, as you've demonstrated, emits the token cours.
But then, the token that's emitted for the term cours is actually cour, as the stemmer sees the term as a plural rather than as a the stem of course.
I'm not certain what the prescribed approach is to work around this; search fuzzification occurs to me, but applied systematically to your queries, that could increase the number of false positive results you retrieve.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.