One of the major pain point with hybrid-search is how to combine BM25 (lexical) score with vector similarity score. BM25 scores are unbounded and their absolute value does not indicate degree of relevance. While on the other hand, vector (cosine) similarity score is bounded between 0 and 1, and its absolute value indicates degree of relevance.
Due to difference in scale of their scores, it is difficult to combine them.
Opensearch provides score normalization feature called, Normalization processor. Is there something similar in elasticsearch?
PS: RRF is one of the way to combine BM25 and vector similarity score. But currently this feature is in technical preview and also RRF is not be suitable for all the use-cases.