Ranking result using key-words


(Willian Gigliotti) #1
Hi,

I have read only access to an ElasticSearch 0.90 version with a huge 

collection of Brazilian Official Gazette pages.

We are using that to search for people names like "Gisele Bundchen". 

Our search is something like this:

  • {*
  •       "from": 0,*
    
  •       "size": 2,*
    
  •       "query": {*
    
  •          "query_string": {*
    
  •             "query": "\"Gisele Bundchen\""*
    
  •          }*
    
  •       }*
    
  • }*
And I found 30 pages, for instance. But most of them are useless to us. 

I would love to rank the results based on some key-words, let's say "legal
settlement", "jail" and "lawsuit". So I tried the following search:

  • { "from": 0, "size": 2, "query":
    { "query_string": { "query": ""Gisele
    Bundchen"^5 "legal settlement"^1 jail^2 lawsuit^2"
    } } }*
I got a ranked result, but now I have five million pages instead of 30 

from the first result.

I could use AND instead of OR, but I don't want to risk missing any 

relevant results.

Would you have any ideas on how to fix that?

Thanks a lot!

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/cc4e9d6d-9bdb-48d2-b6f0-29358b17f642%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


(Ankush Jhalani) #2

You could do something like
"+"Gisele Bundchen"^5 "legal settlement"^1 jail^2 lawsuit^2" which
would mean results must have Gisele Bundchen while others are optional and
help in ranking results.

On Wednesday, April 16, 2014 4:57:41 PM UTC-4, Willian Gigliotti wrote:

Hi,

I have read only access to an ElasticSearch 0.90 version with a huge 

collection of Brazilian Official Gazette pages.

We are using that to search for people names like "Gisele Bundchen". 

Our search is something like this:

  • {*
  •       "from": 0,*
    
  •       "size": 2,*
    
  •       "query": {*
    
  •          "query_string": {*
    
  •             "query": "\"Gisele Bundchen\""*
    
  •          }*
    
  •       }*
    
  • }*
And I found 30 pages, for instance. But most of them are useless to 

us. I would love to rank the results based on some key-words, let's say
"legal settlement", "jail" and "lawsuit". So I tried the following search:

  • { "from": 0, "size": 2, "query":
    { "query_string": { "query": ""Gisele
    Bundchen"^5 "legal settlement"^1 jail^2 lawsuit^2"
    } } }*
I got a ranked result, but now I have five million pages instead of 30 

from the first result.

I could use AND instead of OR, but I don't want to risk missing any 

relevant results.

Would you have any ideas on how to fix that?

Thanks a lot!

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/72a0949c-61a4-4060-9b61-1d716c312846%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


(Willian Gigliotti) #3

Thanks Ankush, that worked!

Em quarta-feira, 16 de abril de 2014 18h01min28s UTC-3, Ankush Jhalani
escreveu:

You could do something like
"+"Gisele Bundchen"^5 "legal settlement"^1 jail^2 lawsuit^2" which
would mean results must have Gisele Bundchen while others are optional and
help in ranking results.

On Wednesday, April 16, 2014 4:57:41 PM UTC-4, Willian Gigliotti wrote:

Hi,

I have read only access to an ElasticSearch 0.90 version with a huge 

collection of Brazilian Official Gazette pages.

We are using that to search for people names like "Gisele Bundchen". 

Our search is something like this:

  • {*
  •       "from": 0,*
    
  •       "size": 2,*
    
  •       "query": {*
    
  •          "query_string": {*
    
  •             "query": "\"Gisele Bundchen\""*
    
  •          }*
    
  •       }*
    
  • }*
And I found 30 pages, for instance. But most of them are useless to 

us. I would love to rank the results based on some key-words, let's say
"legal settlement", "jail" and "lawsuit". So I tried the following search:

  • { "from": 0, "size": 2, "query":
    { "query_string": { "query": ""Gisele
    Bundchen"^5 "legal settlement"^1 jail^2 lawsuit^2"
    } } }*
I got a ranked result, but now I have five million pages instead of 

30 from the first result.

I could use AND instead of OR, but I don't want to risk missing any 

relevant results.

Would you have any ideas on how to fix that?

Thanks a lot!

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/9805dcfd-3c58-4cf5-b9c4-9588dc437847%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


(system) #4