Aggregations large data in real time in Elasticsearch. What solution is the best?

ndhcoder · March 5, 2020, 2:10am

Hi all,
I have a question ?

With a system have about a billion logs each days, How can i get all my aggregations data like "count request error per host", "count request per days, per hours, ..", in time series in real time, that mean when i request aggregations, the query can consume old results before to calculating new result for best performance, how we do that in Elasticsearch .. or exist any way better ?

Thanks all. Sorry about my bad english.

warkolm · March 5, 2020, 7:34pm

The basic answer is that Elasticsearch calculates all of that at request time.
It doesn't run these sorts of calculations on a scheduled and then store the results for any requests that are made.

So any request you make will be a real time one.

ndhcoder · March 6, 2020, 7:27am

Hi, Thanks for reply.
What concept of Elasticsearch that i use to implement it ?

Christian_Dahlqvist · March 6, 2020, 7:37am

It sounds like you might be interested in the roll up api. I believe there are videos and. Log posts about it, but do not have links handy at the moment.

dadoonet · March 6, 2020, 7:44am

I think you need to look at aggregations. See https://www.elastic.co/guide/en/elasticsearch/reference/current/search-aggregations.html

ndhcoder · March 6, 2020, 9:07am

Hi, Thanks for suggestion, but i saw the search aggregations, that calculate again each time we searching, so performance seem bad for large data set, i need tracking realtime

dadoonet · March 6, 2020, 9:07am

so performance seem bad for large data set

And did you try it?

ndhcoder · March 6, 2020, 9:10am

No, I read the document, that don't say about any mechanism for cache old results, when my data set is billion document each day, when multi user query for tracking request in realtime. I think it is a bad idea in this case.

ndhcoder · March 6, 2020, 9:11am

The results need to be updated per seconds

dadoonet · March 6, 2020, 10:26am

I don't think it is.
I'd really recommend that you try before saying it's not going to work. You could be very surprised.

ndhcoder · March 6, 2020, 10:30am

Yep. I'll try that. Thanks you

system · April 3, 2020, 10:30am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
ElasticSearch for realtime metrics/KPIs of a system Elasticsearch	9	876	April 22, 2021
Evaluating ElasticSearch: Is it possible to run multiple value aggregations on ~100M records? Elasticsearch	7	799	July 6, 2017
Realtime search structure Elasticsearch	4	310	July 6, 2017
Realtime aggregations per application transaction Elasticsearch aggregations	7	152	March 7, 2024
Multiple aggregation in one request vs one aggregation per request performance Elasticsearch	3	5303	May 8, 2017

Aggregations large data in real time in Elasticsearch. What solution is the best?

Related topics