Query datastream - all shards - how to avoid?

abenas · August 24, 2020, 1:43pm

Hi,

We currently use a date pattern in index name to contain a monthly breakdown of documents. In this way, an app can query the right index (or indexes) based on an initial/end date, avoiding hitting all indexes / shards.

Is it possible to do something like this with DataStream? Is there some strategy to avoid hitting all shards based on a date criteria? (like routing by @timestamp)

Christian_Dahlqvist · August 24, 2020, 5:11pm

Kibana used to do this, but that was later removed as hitting all indices was made a lot more efficient. Have you tested how much difference it makes in your use case, e.g. clear caches and hit all indices and then compare that to clearing caches and only hitting the required indices?

abenas · August 24, 2020, 5:51pm

I'll perform this test.

In the past, we've noticed high load and cpu usage in the cluster when multiple search requests are made by clients that hit many indexes. We solve this scenario by building the search url dynamically, using the date parameters, so the requests just hit the right indexes.

system · September 21, 2020, 5:51pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Migrate to datastreams withou aliases + datastreams filter Elasticsearch datastreams	4	322	February 16, 2023
Date Index + Alias question Elasticsearch	5	883	December 30, 2017
Automatic skipping of indexes / shards for date-based indexing and index sorting Elasticsearch	3	300	November 22, 2022
How search in datastream by timestamp works? Elasticsearch datastreams	5	592	June 28, 2022
Avoid hiting unecessary shards with wildcard dated index Elasticsearch	3	188	April 22, 2024

Query datastream - all shards - how to avoid?

Related topics