I have a project which provides to brands monitoring their brand
names. I want to store data separately into different index for each
brand. Is there a limit for number of indexes with the Elastic Search
can real work? Will ES still work if i have 10k - 100k indexes? Each
index will have around 100k new rows every month.
I want to store data separately, beacuse we always select data only
for one brand and I want to use advantage of _open/_close index.
See this thread: Redirecting to Google Groups, but in general, its pretty expensive to have many indices (even with one shard).
On Tuesday, February 28, 2012 at 12:48 PM, Damian Brhel wrote:
Hi,
I have a project which provides to brands monitoring their brand
names. I want to store data separately into different index for each
brand. Is there a limit for number of indexes with the Elastic Search
can real work? Will ES still work if i have 10k - 100k indexes? Each
index will have around 100k new rows every month.
I want to store data separately, beacuse we always select data only
for one brand and I want to use advantage of _open/_close index.
On Wednesday, February 29, 2012 8:53:16 AM UTC-5, kimchy wrote:
See this thread: Redirecting to Google Groups,
but in general, its pretty expensive to have many indices (even with one
shard).
Could you elaborate a bit on the overhead of indices? A single
index-per-object is gives us a perfect setup for our dataflow, but how far
can this scale reasonably? Though we're starting out at ~20,000 indices,
I'd like to be able to plan for 1 million if possible.
If this isn't too far fetched, which is the more important scaling factor,
node size or node count?
Could you elaborate a bit on the overhead of indices? A single
index-per-object is gives us a perfect setup for our dataflow, but how
far can this scale reasonably? Though we're starting out at ~20,000
indices, I'd like to be able to plan for 1 million if possible.
If this isn't too far fetched, which is the more important scaling
factor, node size or node count?
That is indeed far fetched Unless you plan on having hundreds of
thousands of servers...
However, filtered aliases might be what you are after.
I suggest taking a look at Shay's presentation at berlin buzzwords this
year:
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.