I have some significant skew in document sizes in a particular index. Some
docs can be 1k and some can be as large~3g. However, the number of very
large documents is very small. As a result, any queries that match the
large documents require streaming the entire document from disk. This
results in extremely long search response latencies. The situation is
compounded since these large documents usually get hit together, and
typically end up on the same page.
Ideally, I would like to use exclude on _source on a per document basis
when indexing as specifying this in the mapping is too general.
Additionally, I would like the exclusion to take place *after *indexing but
before being stored. Is there any way to achieve this effect, perhaps using
an update request?
Secondly, when using a mapping with _source excludeshttp://www.elasticsearch.org/guide/reference/mapping/source-field/,
does this exclusion happen before or after indexing? Is this configurable?
Thank you advance,
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to email@example.com.
For more options, visit https://groups.google.com/groups/opt_out.