I have an index running on 6.3. It is about 11MM documents, and since the documents are nested, they are much larger in count. These documents represent information stored in multiple databases relating to a specific id (a long int). Since it is unique, that is provided as the _id for the document as well.
I am looking at ways to tune the indexing for this. One thing that caught my eye when reading the guide was to use autogenerated iDs.
Here is my concern.
- Would unique IDs such as what I have provided be enough or does the autogenerated ID have provide something much better?
- If I were to use autogenerated IDs, how will I update the document using the Update API? I don't have the ID anymore (since that's autogenerated). And I wasn't intending to store the autogenerated IDs in any other database / datastore.
- Thinking aloud on the second point, should I be making a retrieval call on Elasticsearch just to check for the autogenerated ID and "upsert" accordingly?