Elastic search and data clustering/grouping


(noor) #1

Hi,

Me and my team have started a eCommerce project. We are using elasticsearch
to provide quick search for products. Apart from normal searching we also
want our products to be grouped or clustered based on the similarity of
there features. e.g. If a user search for 'Apple' then search result should
show different types of products from apple where each search result record
represents a specific type like iphone, or mac book etc. (Note: the product
features varies from product to product).

My question is that can it be achived from elasticsearch only or we have to
use other projects like mahout or weka for clustering the similar products
from search result. If so then please suggest me how we can integrate
mahout/weka with ealsticsearch.

It would be great for us if anyone who had experience in this area can give
us suggestion to address this problem.

Thanks,

Noor

--


(Ivan Brusic) #2

Noor,

You can aggregate results using facets, which provides similar
functionality to using GROUP BY in SQL.

For true grouping, also called field collapsing, is [yet] not
supported in ElasticSearch (even Lucene support is not consistent):

--

Ivan

On Tue, Sep 4, 2012 at 11:16 PM, Noor nooremailbox@gmail.com wrote:

Hi,

Me and my team have started a eCommerce project. We are using elasticsearch
to provide quick search for products. Apart from normal searching we also
want our products to be grouped or clustered based on the similarity of
there features. e.g. If a user search for 'Apple' then search result should
show different types of products from apple where each search result record
represents a specific type like iphone, or mac book etc. (Note: the product
features varies from product to product).

My question is that can it be achived from elasticsearch only or we have to
use other projects like mahout or weka for clustering the similar products
from search result. If so then please suggest me how we can integrate
mahout/weka with ealsticsearch.

It would be great for us if anyone who had experience in this area can give
us suggestion to address this problem.

Thanks,

Noor

--

--


(system) #3