I am working with reddit comments and want to extract bots for analysis. What I'd like to do is:
- Find all unique authors where the comment reply_delay is less than a specific value. Example:
curl -s -H "Content-Type: application/json" -XGET 'localhost:9200/rc/_search?q=reply_delay:<1' -d '{"size":0,"aggs":{"authors":{"terms":{"field":"author.keyword"}}}}'
- From this set of authors, I want to pull all documents matching for those authors and then use that document set for new aggregations.
So essentially I'm trying to:
- Use a query to limit the initial document set and run a terms query on that.
- From that aggregation, I want to build a new document set that I can use additional aggregations against.
Is this a pipeline? Is this even possible?