More like this once again

Hi.

I’m using MLT. Trying to change percent_terms_to_match parameter, but
nothing changes.

Values 0.3, 0.5, 1

returns same results including only 1 term match between docs.

Is it correct?

Sample of my query:

/_mlt?mlt_fields=positive,negative&min_doc_freq=1&max_query_terms=1000&analyzer=whitespace&percent_terms_to_match=0.5

H Yark

I’m using MLT. Trying to change percent_terms_to_match parameter, but
nothing changes.
Values 0.3, 0.5, 1
returns same results including only 1 term match between docs.

Is it correct?

It is impossible to say without knowing your data.

I suggest you play with min_term_freq, min_doc_freq - you're probably
testing this on a small number of docs.

I can tell you that we're using mlt with great success in production

clint

My data just set of md5 strings divided by space. This md5 hashes of my
specific objects. There 30 to 500 different hashes (terms?) in one doc with
total of 160 to 2500 tems (one hash may be repeated from 1 to 10 times)

I need to find docs, where more than only 1 or 2 terms (md5 hashes) is
matches.