Say if I want to trigger an event, that whenever a record is added to
Elastic Search engine it also gets added to Mahout, Mahout actually
doesn't understand the indexed files. Is it possible to index the
records in ES in another format like csv, json, etc? Since ES has
emerged from Lucene, I believe there should be a way to do it.
I would really appreciate any information on this topic.
Aren't rivers what you need?
I didn't fully get what they are, but I see them as a way to plug another
source/end to ES.
If I'm right, you could write a river that would notify Mahout of any new
documents indexed in ES.
I scanned your page quickly, but I think it is what is done with Solr.
I am facing problem integrating Elastic Search with Mahout because in
ES am not able to control the format in which the files are indexed.
For example in Solr or Lucene, following links
Say if I want to trigger an event, that whenever a record is added to
Elastic Search engine it also gets added to Mahout, Mahout actually
doesn't understand the indexed files. Is it possible to index the
records in ES in another format like csv, json, etc? Since ES has
emerged from Lucene, I believe there should be a way to do it.
I would really appreciate any information on this topic.
Docs are stored as Json in ES already. All you need to do is execute search
in ES, get the source from the docs as json, and then push the data to
Mahout.
Have you created in index in ES and run queries against it?
Regards,
Berkay Mollamustafaoglu
mberkay on yahoo, google and skype
On Mon, May 23, 2011 at 10:17 AM, Olivier Favre olivier@yakaz.com wrote:
Aren't rivers what you need?
I didn't fully get what they are, but I see them as a way to plug another
source/end to ES.
If I'm right, you could write a river that would notify Mahout of any new
documents indexed in ES.
I scanned your page quickly, but I think it is what is done with Solr.
I am facing problem integrating Elastic Search with Mahout because in
ES am not able to control the format in which the files are indexed.
For example in Solr or Lucene, following links
Say if I want to trigger an event, that whenever a record is added to
Elastic Search engine it also gets added to Mahout, Mahout actually
doesn't understand the indexed files. Is it possible to index the
records in ES in another format like csv, json, etc? Since ES has
emerged from Lucene, I believe there should be a way to do it.
I would really appreciate any information on this topic.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.