I'm not sure whether you have one or multiple questions but it's perfectly fine to use ES for both storage and search.
You can use HDFS as a snapshot/backup store to further improve the resilience of your system.
Millions of documents is not an issue for ES
On 1/29/15 4:29 PM, Manoj Singh wrote:
Hi,
I have one question related to performance of ES with Hadoop.
Our Architecture:
use hadoop for storage big data as we have millions of data.
Feed to ES from Hadoop via API.
Search will work through ES.
Will this architecture have performance issue ?
OR We simple use ES for millions storage and search.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.