Me and my colleagues are trying to figure out the best way to manage tags on a bunch of documents.
Our scenario is, many several devices that send data to elastic, the users could set different tags on the device through a web application in order to organize the work.
We would avoid to update hundred of thousand of documents every time a user adds or removes a tag, we are thinking to create a single parent document (per device) which contains the variables data, like tags, and links it to device's documents with a "join" field datatype during the indexing process.
This would allow to update a single document per device every time that a change happen.
Could be a good solution? Any suggestion or tip would be great!
after some research i found another possible drawback of join, the parent and child must belong to the same shard so if i rollover my index with an ilm policy I'll have to create again the parent document, right?
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.