So I am trying to save comment objects to ES, should I just use comment_id as the native _id field? Later if I am to retrieve 500 comments, will query by ids
{
"query": {
"ids" : {
"values" : [id1, id2, ...]
}
}
}
, faster than terms: {id: [id1,id2,]}
@cynosureabu
ES guarantees uniqueness of _id in the index. If values are stored as id field you can store duplicates. Two options are not technically same.
While getting a single document by _id localhost:9200/<index>/<_id>, ES can determine the shard this id belongs to. It will be optimal compared to running a query which will be sent to all shards
If you are searching multiple ids, _id and id field should perform same.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.