Significant field aggregation ElasticSearch

Vincent_Stevenson · March 21, 2019, 4:21pm

Is there something similar to ElasticSearch's significant_text aggregation, but for significant_fields?

Example: based on the foreground of Json documents, there is an uncommonly common occurrence of the field "RAM capacity" when doing a query on computers. Is there an aggregation or Json query type that will return those uncommonly common field names (RAM capacity, CPU type, HDD capacity, etc)?

Thanks!

Mark_Harwood · March 21, 2019, 5:06pm

Nothing out of the box.
A couple of options spring to mind:

Do-it-yourself in the client-side by performing the calculation on foreground/background numbers obtained from a combination of the global agg (for background stats) and the filters agg with exists queries for the fields of interest
Re-index the content with a fieldnames array of the type keyword that lists the fields available on a doc. Using significant_terms agg to compute the significance of values.

Vincent_Stevenson · March 21, 2019, 6:03pm

Thanks Mark!

Mark_Harwood · March 21, 2019, 7:13pm

FYI the field exists query and the adjacency matrix agg can be used together to show how doc types and fields relate: https://youtu.be/JzHRuJOCnR0

system · April 18, 2019, 7:13pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Can Significant text aggregation work on copy_to fields Elasticsearch	3	376	May 11, 2021
Significant text aggregation with custom analyzer in Elasticsearch 6 Elasticsearch	6	1065	December 21, 2017
Significant Terms Aggregation to many fields Elasticsearch	2	464	February 16, 2018
Significant_term for other metrics? Elasticsearch	4	352	July 6, 2017
Aggregation with significant_text on wildcard fields Elasticsearch	2	1457	December 17, 2020

Significant field aggregation ElasticSearch

Related topics