Can i filter Facets by count>1


(Weiwei Wang) #1

I need to return facet terms whose count>1, how could i achieve it?

Currently I can retrieve all the facets and filter out those whose
frequency=1, is there any better approach?


(Cédric EXBRAYAT) #2

I've got a similar problem.

In SolR, there is something like minCount on facets
http://wiki.apache.org/solr/SimpleFacetParameters#facet.mincount

Am I missing something in elasticsearch to do that ?

On 13 juin, 05:58, Weiwei Wang ww.wang...@gmail.com wrote:

I need to returnfacetterms whosecount>1, how could i achieve it?

Currently I can retrieve all the facets and filter out those whose
frequency=1, is there any better approach?


(Shay Banon) #3

Which facet are you referring to, the terms one?

On Thursday, June 23, 2011 at 10:41 AM, Cedric wrote:

I've got a similar problem.

In SolR, there is something like minCount on facets
http://wiki.apache.org/solr/SimpleFacetParameters#facet.mincount

Am I missing something in elasticsearch to do that ?

On 13 juin, 05:58, Weiwei Wang <ww.wang...@gmail.com (http://gmail.com)> wrote:

I need to returnfacetterms whosecount>1, how could i achieve it?

Currently I can retrieve all the facets and filter out those whose
frequency=1, is there any better approach?


(Weiwei Wang) #4

hi, shay,
i'm using TermsFacet to find the duplicate documents based on
certain field.

I have another question is that, i want to know the count of each term
and the ids associated with this term, the current version of ES
support this kind of operation?

On Jun 23, 7:56 pm, Shay Banon shay.ba...@elasticsearch.com wrote:

Which facet are you referring to, the terms one?

On Thursday, June 23, 2011 at 10:41 AM, Cedric wrote:

I've got a similar problem.

In SolR, there is something like minCount on facets
http://wiki.apache.org/solr/SimpleFacetParameters#facet.mincount

Am I missing something in elasticsearch to do that ?

On 13 juin, 05:58, Weiwei Wang <ww.wang...@gmail.com (http://gmail.com)> wrote:

I need to returnfacetterms whosecount>1, how could i achieve it?

Currently I can retrieve all the facets and filter out those whose
frequency=1, is there any better approach?


(Cédric EXBRAYAT) #5

In my case, it's a date histogram (but it can be any facet), and we'd like
to return only the facets with a minimum number of documents.
(for example : return the users with at least 5 tweets during a week).

The 'at least 5' part is the problem : we can filter manually but I was
wondering if something can do the job in ES (as it can be done in SolR with
a minCount).

On Thu, Jun 23, 2011 at 2:22 PM, Weiwei Wang ww.wang.cs@gmail.com wrote:

hi, shay,
i'm using TermsFacet to find the duplicate documents based on
certain field.

I have another question is that, i want to know the count of each term
and the ids associated with this term, the current version of ES
support this kind of operation?

On Jun 23, 7:56 pm, Shay Banon shay.ba...@elasticsearch.com wrote:

Which facet are you referring to, the terms one?

On Thursday, June 23, 2011 at 10:41 AM, Cedric wrote:

I've got a similar problem.

In SolR, there is something like minCount on facets
http://wiki.apache.org/solr/SimpleFacetParameters#facet.mincount

Am I missing something in elasticsearch to do that ?

On 13 juin, 05:58, Weiwei Wang <ww.wang...@gmail.com (http://gmail.com)>
wrote:

I need to returnfacetterms whosecount>1, how could i achieve it?

Currently I can retrieve all the facets and filter out those whose
frequency=1, is there any better approach?


(Cédric EXBRAYAT) #6

Shoul I open a proposal for a new feature on github or am I missing
something ?

(Be able to filter facets with a minimum number of documents to request for
example 'the users with at least 5 tweets during a week')

On Thu, Jun 23, 2011 at 3:32 PM, Cédric EXBRAYAT cexbrayat@gmail.comwrote:

In my case, it's a date histogram (but it can be any facet), and we'd like
to return only the facets with a minimum number of documents.
(for example : return the users with at least 5 tweets during a week).

The 'at least 5' part is the problem : we can filter manually but I was
wondering if something can do the job in ES (as it can be done in SolR with
a minCount).

On Thu, Jun 23, 2011 at 2:22 PM, Weiwei Wang ww.wang.cs@gmail.com wrote:

hi, shay,
i'm using TermsFacet to find the duplicate documents based on
certain field.

I have another question is that, i want to know the count of each term
and the ids associated with this term, the current version of ES
support this kind of operation?

On Jun 23, 7:56 pm, Shay Banon shay.ba...@elasticsearch.com wrote:

Which facet are you referring to, the terms one?

On Thursday, June 23, 2011 at 10:41 AM, Cedric wrote:

I've got a similar problem.

In SolR, there is something like minCount on facets
http://wiki.apache.org/solr/SimpleFacetParameters#facet.mincount

Am I missing something in elasticsearch to do that ?

On 13 juin, 05:58, Weiwei Wang <ww.wang...@gmail.com (
http://gmail.com)> wrote:

I need to returnfacetterms whosecount>1, how could i achieve it?

Currently I can retrieve all the facets and filter out those whose
frequency=1, is there any better approach?


(Shay Banon) #7

Yea, you can open a feature request.

On Monday, June 27, 2011 at 11:13 AM, Cédric EXBRAYAT wrote:

Shoul I open a proposal for a new feature on github or am I missing something ?

(Be able to filter facets with a minimum number of documents to request for example 'the users with at least 5 tweets during a week')

On Thu, Jun 23, 2011 at 3:32 PM, Cédric EXBRAYAT <cexbrayat@gmail.com (mailto:cexbrayat@gmail.com)> wrote:

In my case, it's a date histogram (but it can be any facet), and we'd like to return only the facets with a minimum number of documents.
(for example : return the users with at least 5 tweets during a week).

The 'at least 5' part is the problem : we can filter manually but I was wondering if something can do the job in ES (as it can be done in SolR with a minCount).

On Thu, Jun 23, 2011 at 2:22 PM, Weiwei Wang <ww.wang.cs@gmail.com (mailto:ww.wang.cs@gmail.com)> wrote:

hi, shay,
i'm using TermsFacet to find the duplicate documents based on
certain field.

I have another question is that, i want to know the count of each term
and the ids associated with this term, the current version of ES
support this kind of operation?

On Jun 23, 7:56 pm, Shay Banon <shay.ba...@elasticsearch.com (mailto:shay.ba...@elasticsearch.com)> wrote:

Which facet are you referring to, the terms one?

On Thursday, June 23, 2011 at 10:41 AM, Cedric wrote:

I've got a similar problem.

In SolR, there is something like minCount on facets
http://wiki.apache.org/solr/SimpleFacetParameters#facet.mincount

Am I missing something in elasticsearch to do that ?

On 13 juin, 05:58, Weiwei Wang <ww.wang...@gmail.com (mailto:ww.wang...@gmail.com) (http://gmail.com)> wrote:

I need to returnfacetterms whosecount>1, how could i achieve it?

Currently I can retrieve all the facets and filter out those whose
frequency=1, is there any better approach?


(system) #8