Term filter vs multiple indices

Eldad_Moneta · March 20, 2017, 5:51pm

Hi,

I have millions of metrics for ~1 hundred sources (the number of sources will raise in the future).
A query is always for a single source.
I'm already splitting the indices per day.

I'm trying to decide between multiple indices - one per source vs a single index with the source as a term in the document (in this case, all queries will use a term filter with the source id).

Is there a performance difference between the 2 above? during index? during query?

The cluster is write heavy- thousands of index requests (that translate to tens of bulk requests) per second.

Thanks.

dadoonet · March 20, 2017, 6:06pm

If whatever the source all docs have same fields, I'd try with one single index.

If not, I'd try to avoid sparse fields and would split into separate indices.

Eldad_Moneta · March 21, 2017, 7:56am

Thank you @dadoonet.
All docs are of the same type and have the same fields so I guess the answer is single index.
Does this answer based on better maintainability, or there are performance considerations as well?

dadoonet · March 21, 2017, 8:17am

Sharing the same Lucene instances will consume less resources. That's why I'd probably mix that.

system · April 18, 2017, 8:18am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Guidance on Using Multiple Indexes vs One Index for Time Series Data from Multiple Sources Elasticsearch	11	9327	July 5, 2017
Single Index vs. Multiple Indices Elasticsearch	9	4856	November 25, 2018
How is performance affected on distribution of data over multiple indices Elasticsearch	3	304	July 6, 2017
One index vs multiple indexes? Elasticsearch	7	5218	February 26, 2019
One index or multiple index for exact same mapping, but the data are clustered based on a field Elasticsearch	2	48	May 20, 2025

Term filter vs multiple indices

Related topics