Top 10 Filtering Appears inconsistent

daniel.eaton · March 18, 2019, 9:16pm

Hi All,

Just had a Visualisation/Functionality question regarding Kibana's "Show Terms" functionality.

I am visualising the top 5 Customers (Text, Keyword) by Transaction Amount Total (Sum, Number).

When I limit the number of customers to be displayed to 5, (Based on Descending Sum of Transaction Amount). I get:
Customer A (with 100k),
Customer B (with 90k),
Customer C (with 80k),
Customer D (with 70k),
Customer E (with 60k)
(Aliased for privacy reasons).

However, When I expand this to double check my working, to the top 1000, the first 5 entries are no longer A, B, C, D, E, and are instead
Customer A (100k),
Customer J (95k),
Customer B (90k),
Customer M (88k),
Customer Z (85k)

Any Idea why I would be getting incorrect information when i reduce the size of the Terms shown? I am using Customer_Name.Keyword field as per below,

"Customer_Name": {
              "type": "text", 
              "fields": 
              {
                "keyword":
                {
                 "type":"keyword" 
                }
              }
            },

lukas · March 20, 2019, 5:03pm

Hmm, that's definitely strange. Have you looked at the request/response from Elasticsearch to see if that's the data coming back from Elasticsearch?

Christian_Dahlqvist · March 20, 2019, 5:30pm

If you have a field with quite high cardinality I believe this is expected as terms aggregations are approximate. How many customers do you have in the index? How many shards is this data distributed across?

daniel.eaton · March 20, 2019, 6:20pm

1 Shard, 1 Replicate
1000-2000 customers - 100,000 documents

Christian_Dahlqvist · March 20, 2019, 6:53pm

If all the data is in a single index with a single shard it sounds strange that it changes. Do the shards have the same number of documents if you look at the _cat/shards API?

daniel.eaton · March 20, 2019, 7:07pm

Sorry, I stand corrected, It somehow ended up on 5 shards (Primary)
get _cat/shards
indx 2 p STARTED 20110 6.1mb 172.30.60.11 jpK1TfI
indx 2 r UNASSIGNED
indx 3 p STARTED 19863 5.7mb 172.30.60.11 jpK1TfI
indx 3 r UNASSIGNED
indx 1 p STARTED 20126 6.1mb 172.30.60.11 jpK1TfI
indx 1 r UNASSIGNED
indx 4 p STARTED 19770 5.7mb 172.30.60.11 jpK1TfI
indx 4 r UNASSIGNED
indx 0 p STARTED 20131 5.7mb 172.30.60.11 jpK1TfI
indx 0 r UNASSIGNED

daniel.eaton · March 20, 2019, 7:41pm

Was a difference because of the shards causing inaccuracies, resolved it !

system · April 17, 2019, 7:41pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
[ES 5.0] Mapping issues Elasticsearch	2	1021	December 8, 2016
Why my elasticsearch showing different counts for field and field.keyword? Elasticsearch	2	458	April 6, 2017
Kibana visualization showing data as "Other" Kibana	7	3138	September 28, 2021
Kibana Lens Visualization showing data as "other" where as I have values for that fields Kibana	4	1344	October 12, 2021
I got diffirent result with same filter between unique_count and list-group-terms in kibana Kibana	3	211	April 26, 2023

Top 10 Filtering Appears inconsistent

Related topics