Question about the ranking in ElasticSearch

maximilian_brodhun · August 1, 2013, 1:23pm

Hallo dear all,

I have ingested a few files (more than 700.000) in an ElasticSearch Index.

The files in the index have to attachement fields where I stored fulltext
data. This field is sometimes really huge.

As I queried my index I missed one special file.

In the attachement filed "ftattach" this document has a text which has over
800 A4 pages. The word I searched for appears just one single time in the
whole text.

Maybe ElasticSearch (or Lucene) calculates that as not relevant because of
a low score.

Is there any posibility to decrement the necessary ranking-score so I can
find the missing file?

Hope you can help me.

kind regards

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

jprante · August 1, 2013, 1:39pm

Maybe it helps to index each page into a document.

Jörg

On Thu, Aug 1, 2013 at 3:23 PM, maximilian.brodhun@googlemail.com wrote:

Hallo dear all,

I have ingested a few files (more than 700.000) in an Elasticsearch Index.

The files in the index have to attachement fields where I stored fulltext
data. This field is sometimes really huge.

As I queried my index I missed one special file.

In the attachement filed "ftattach" this document has a text which has
over 800 A4 pages. The word I searched for appears just one single time in
the whole text.

Maybe Elasticsearch (or Lucene) calculates that as not relevant because of
a low score.

Is there any posibility to decrement the necessary ranking-score so I can
find the missing file?

Hope you can help me.

kind regards

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

maximilian_brodhun · August 2, 2013, 9:34am

The document is a XML file which I can't divide. For other things it is
necessary that document one huge document.

There is no possibility to just decrement ranking requirements so
Elasticsearch list also results with a lower score?

Am Donnerstag, 1. August 2013 15:39:41 UTC+2 schrieb Jörg Prante:

Maybe it helps to index each page into a document.

Jörg

On Thu, Aug 1, 2013 at 3:23 PM, <maximilia...@googlemail.com <javascript:>

wrote:

Hallo dear all,

I have ingested a few files (more than 700.000) in an Elasticsearch
Index.

The files in the index have to attachement fields where I stored fulltext
data. This field is sometimes really huge.

As I queried my index I missed one special file.

In the attachement filed "ftattach" this document has a text which has
over 800 A4 pages. The word I searched for appears just one single time in
the whole text.

Maybe Elasticsearch (or Lucene) calculates that as not relevant because
of a low score.

Is there any posibility to decrement the necessary ranking-score so I can
find the missing file?

Hope you can help me.

kind regards

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearc...@googlegroups.com <javascript:>.
For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

Ivan · August 3, 2013, 1:26pm

Search is returning the document, but just not scored as high as you would
like it? If so, field length might be a problem. You can disable norms on
that field so that length normalization is not occurring.

--
Ivan

On Fri, Aug 2, 2013 at 5:34 AM, maximilian.brodhun@googlemail.com wrote:

The document is a XML file which I can't divide. For other things it is
necessary that document one huge document.

There is no possibility to just decrement ranking requirements so
Elasticsearch list also results with a lower score?

Am Donnerstag, 1. August 2013 15:39:41 UTC+2 schrieb Jörg Prante:

Maybe it helps to index each page into a document.

Jörg

On Thu, Aug 1, 2013 at 3:23 PM, <maximilia...@**googlemail.com> wrote:

Hallo dear all,

I have ingested a few files (more than 700.000) in an Elasticsearch
Index.

The files in the index have to attachement fields where I stored
fulltext data. This field is sometimes really huge.

As I queried my index I missed one special file.

In the attachement filed "ftattach" this document has a text which has
over 800 A4 pages. The word I searched for appears just one single time in
the whole text.

Maybe Elasticsearch (or Lucene) calculates that as not relevant because
of a low score.

Is there any posibility to decrement the necessary ranking-score so I
can find the missing file?

Hope you can help me.

kind regards

--
You received this message because you are subscribed to the Google
Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send
an email to elasticsearc...@**googlegroups.com.

For more options, visit https://groups.google.com/**groups/opt_out https://groups.google.com/groups/opt_out
.

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

Topic		Replies	Views
Problem with Attachment-Plugin and Highlighting Elasticsearch	14	497	July 6, 2017
ES + Attachment --> indexed documents incomplete Elasticsearch	11	605	July 6, 2017
Newbie quesiton re: document size & score Elasticsearch	3	334	July 6, 2017
Missing Documents Elasticsearch	8	531	July 6, 2017
Remove features Elasticsearch	4	550	July 6, 2017

Question about the ranking in ElasticSearch

Related topics