How to improve facet performance?

Min_Cha · October 18, 2013, 5:05am

https://groups.google.com/forum/#!topic/elasticsearch/3j_r5uc_7F4

Hello.
I had the OOM problem and resolved by setting option
'index.cache.field.type: soft'.
At now, my query and facet works file.

But I have faced new problem, pool performance.

For example,

Req : {"size":0,"query":{"field":{"follows":"2324"}},"facets":{"
plays":{"terms":{"size":10,"script":"doc['plays.musicId'].values"}}}}
Res : { took : 169612 ... }

For testing, I changed facet field as following.

Req : {"size":0,"query":{"field":{"follows":"2324"}},"facets":{"
plays":{"terms":{"size":10,"script":"doc['plays.count'].values"}}}}
Res : {took : 5 ...}

This change makes performance good dramatically.
The difference between 'musicId' and 'count' is one.

The count value of all docs is 1.(constant)
The musicId value of each doc is between 1 and 50000 and each doc has
300 musicIds.

There is an way to improve performance?
Please, give me some advice.
Thanks for reading.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

Clinton_Gormley · October 18, 2013, 5:34pm

Hiya

Hello.

I had the OOM problem and resolved by setting option
'index.cache.field.type: soft'.
At now, my query and facet works file.

But I have faced new problem, pool performance.

That's what soft gives you By using soft references, you haven't
solved the OOM problem, you've just forced ES to reload field data all the
time (which is very heavy).

You can set the indices.fielddata.cache.size size to avoid OOMs, but it's a
safety mechanism, not a solution. If you are running out of memory it'll
evict field data, which will affect performance. See

You need more memory, or more nodes, or fewer facets. If you're faceting on
high cardinality string fields, that's going to use a lot of memory.

clint

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

Ivan · October 18, 2013, 6:07pm

Instead of using scripts, can you simply index the eventually value of
plays.count? You indexing process would have to contain more logic, but
this logic is only executed once per document and not each time like it is
during a query.

Can you provide some sample documents?

Cheers,

Ivan

On Fri, Oct 18, 2013 at 10:34 AM, Clinton Gormley clint@traveljury.comwrote:

Hiya

Hello.

I had the OOM problem and resolved by setting option
'index.cache.field.type: soft'.
At now, my query and facet works file.

But I have faced new problem, pool performance.

That's what soft gives you By using soft references, you haven't
solved the OOM problem, you've just forced ES to reload field data all the
time (which is very heavy).

You can set the indices.fielddata.cache.size size to avoid OOMs, but it's
a safety mechanism, not a solution. If you are running out of memory it'll
evict field data, which will affect performance. See
Elasticsearch Platform — Find real-time answers at scale | Elastic
You need more memory, or more nodes, or fewer facets. If you're faceting
on high cardinality string fields, that's going to use a lot of memory.

clint

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

Min_Cha · October 21, 2013, 12:49pm

Thanks for kindly advice.

I have posted
Redirecting to Google Groups.
This page contains samples and specific queries.

Thanks again!

2013년 10월 19일 토요일 오전 3시 7분 3초 UTC+9, Ivan Brusic 님의 말:

Instead of using scripts, can you simply index the eventually value of
plays.count? You indexing process would have to contain more logic, but
this logic is only executed once per document and not each time like it is
during a query.

Can you provide some sample documents?

Cheers,

Ivan

On Fri, Oct 18, 2013 at 10:34 AM, Clinton Gormley <cl...@traveljury.com<javascript:>

wrote:

Hiya

Hello.

I had the OOM problem and resolved by setting option
'index.cache.field.type: soft'.
At now, my query and facet works file.

But I have faced new problem, pool performance.

That's what soft gives you By using soft references, you haven't
solved the OOM problem, you've just forced ES to reload field data all the
time (which is very heavy).

You can set the indices.fielddata.cache.size size to avoid OOMs, but it's
a safety mechanism, not a solution. If you are running out of memory it'll
evict field data, which will affect performance. See
Elasticsearch Platform — Find real-time answers at scale | Elastic
You need more memory, or more nodes, or fewer facets. If you're faceting
on high cardinality string fields, that's going to use a lot of memory.

clint

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearc...@googlegroups.com <javascript:>.
For more options, visit https://groups.google.com/groups/opt_out.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.

Topic		Replies	Views
Kniffelig facet performance Elasticsearch	1	251	July 6, 2017
How to improve performance of facet queries? Elasticsearch	7	1291	July 6, 2017
Why to cause OOM when searching with query and facet? Elasticsearch	3	701	July 6, 2017
How to improve facet search performance? Elasticsearch	4	436	July 6, 2017
Facets performance issue Elasticsearch	4	334	July 6, 2017

How to improve facet performance?

Related topics