Group by and sum operation for fields

cekeriya · February 9, 2018, 7:13am

First of all I want to tell you that I am new at ElasticSearch

I have a JSON in following format. I want to group by "CallTo" field and sum of "Count" value for each "CallTo" I will use this values for visulation in Kibana

My first question is what my index should be for group by and sum operation that I mentioned? And my second question is what is the query?

My Json

{
  "Labels": "qwerty",
  "Type": "type1",
  "Id": "id12345",
  "FieldName": [
    {
      "CallTo": "Tom",
      "Count": 2
    },
    {
      "CallTo": "Jessica",
      "Count": 4
    },
    {
      "RegionCode": "US",
      "Count": 1
    },
    {
      "RegionCode": "DE",
      "Count": 5
    },
    {
      "CallCategory": "K1",
      "Count": 6
    }
  ],
  "OtherField": [
    {
      "Key": "bin5",
      "Value": 0
    },
    {
      "Key": "bin1",
      "Value": 0
    },
    {
      "Key": "bin3",
      "Value": 2
    },
    {
      "Key": "binOther",
      "Value": 0
    }
  ],
  "XField": [
    {
      "Key": "bin50000",
      "Value": 1
    },
    {
      "Key": "bin10000",
      "Value": 3
    },
    {
      "Key": "bin30000",
      "Value": 4
    },
    {
      "Key": "binOther",
      "Value": 7
    }
  ]
}

My expected result my be like this

{
  {
    "CallTo": "Tom",
    "Count": 23
  },
  {
    "CallTo": "Jessica",
    "Count": 44
  },
  {
    "RegionCode": "US",
    "Count": 18
  },
  {
    "RegionCode": "DE",
    "Count": 58
  },
  {
    "CallCategory": "K1",
    "Count": 46
  }
}

Also I am open for alternative solution even change Json format

Thanks

dadoonet · February 9, 2018, 7:34am

I wonder if you should instead index every single "call" instead of indexing a list of calls.
But for that it would be better to explain what the use case is about and what the documents represent IMO.

cekeriya · February 9, 2018, 8:07am

It is an another solution but I guess this approach may causes some performance problem or others(I can't be sure about that because I am new in Elasticsearch).

FieldName array size could be millions. If I index every single "call" It will cause anything?

Can you show your solution for input JSON please ?

Thanks

dadoonet · February 9, 2018, 8:36am

I guess this approach may causes some performance problem

I think the total opposite.

cekeriya · February 9, 2018, 10:07am

What is your input json format than?

dadoonet · February 9, 2018, 10:38am

i'm not sure because I don't know a lot about your data. But something like:

PUT calls/doc/1
{
  "Labels": "qwerty",
  "Type": "type1",
  "Id": "id12345",
  "CallTo": "Tom",
  "RegionCode": "US",
  "CallCategory": "K1"
}

cekeriya · February 9, 2018, 11:39am

First of all I am really appreciate for your answer, but your proposition is not proper for my case.
Let me explain deeply what is the meaning of my input json which I attach in my answer

I work on some voip call event to get some valuable data during a time interval. In specified time interval I start an aggregation policy to get JSON, I mean storing aggregation result JSON is more important than storing all call event.

dadoonet · February 9, 2018, 4:37pm

In specified time interval I start an aggregation policy to get JSON, I mean storing aggregation result JSON is more important than storing all call event.

Why not doing the aggregation in real time in elasticsearch instead? Using whatever period you want...

cekeriya · February 12, 2018, 5:25am

Aggregation process is out of my control. I get JSON from ElasticSearch for group by and sum operations etc. My responsibility is on only output jsons(attached above). Therefore we should focus on it

dadoonet · February 12, 2018, 12:39pm

I thought you said that you can control how data is produced:

Also I am open for alternative solution even change Json format

Can't you?

cekeriya · February 13, 2018, 12:48pm

I use this approach and it works for me now =)
Thanks

system · March 13, 2018, 1:01pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Need help with Sum Elasticsearch	4	303	July 6, 2017
Elasticsearch Group By Elasticsearch	5	231	November 1, 2022
Read from Kibana index Elasticsearch	1	282	July 19, 2018
Group By and Sum the values in each group Kibana	3	17357	July 6, 2017
Elastic search - group by query on array Elasticsearch	5	4418	July 6, 2017

Group by and sum operation for fields

Related topics