Es-Hadoop Doc Value Access

chuck · January 29, 2015, 6:52pm

I'm curious about reaching deeper into the lucene internals with es-hadoop,
in a similar way that the aggregations module works. While aggregations
are amazing, there are cases where they aren't an ideal solution, mainly
due to the inability to shuffle/repartition the data as it moves through an
analytic. I realize the current implementation can pull single fields by
using an include/exclude on the query, but since this has to go to the
source it does not strike me as a performant solution. With an es-spark
interface that could pull doc values/doc ids in a similar way that
aggregations do, it would be possible to create arbitrary analytics on any
query context. Has any thought been given to this?

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/9093e6de-370d-4f5d-a2ad-1f6f919f9d9f%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

chuck · January 30, 2015, 6:47pm

So I guess I missed that fielddata fields could be specified in the search
request body. That's pretty cool!

On Thursday, January 29, 2015 at 1:52:10 PM UTC-5, Elliott Bradshaw wrote:

I'm curious about reaching deeper into the lucene internals with
es-hadoop, in a similar way that the aggregations module works. While
aggregations are amazing, there are cases where they aren't an ideal
solution, mainly due to the inability to shuffle/repartition the data as it
moves through an analytic. I realize the current implementation can pull
single fields by using an include/exclude on the query, but since this has
to go to the source it does not strike me as a performant solution. With
an es-spark interface that could pull doc values/doc ids in a similar way
that aggregations do, it would be possible to create arbitrary analytics on
any query context. Has any thought been given to this?

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/3746683a-9823-4a04-aaf4-03d36c6d6b89%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

Topic		Replies	Views
Accessing FieldData in custom Aggregator in ES 2.1.1 Elasticsearch	2	1056	July 5, 2017
Is enabling docValues and disabling index possible Elasticsearch	2	379	July 6, 2017
ElasticSearch+Hadoop+Spark Elasticsearch	2	964	July 6, 2017
ES using DocValues internally yet? Elasticsearch	3	710	July 6, 2017
Possibility of querying all types in elasticsearch-spark instead of the strict index/type es.resource Elasticsearch es-hadoop	7	1673	July 6, 2017

Es-Hadoop Doc Value Access

Related topics