we upgraded to ES 5 and we are trying to check the disk usage of our snapshots by index. The new scheme of using index UUIDs as folder names makes it impossible for us to know what folder corresponds to what index.
For indices that are still on ES we can get an index's UUID with the _cat/indices API but for indices that have been deleted on the cluster we cannot find that info and the snapshot API does not provide it.
I'm not sure whether we should expose the location of the actual files in our APIs or whether that should remain an implementation detail. If you think that this is an important feature to have, please open an issue on Elasticsearch's Github repository and explicitly mark it as a feature request.
As I said above my goal is not really to find out the paths but to see how much disk space each index takes up on the repository. I was able to do that back on 2.0 when paths where named after indices but not anymore.
You think I should open a feature request for getting the snapshoted index sizes on the API? Maybe that's available already somewhere else and I missed it?
I could not find any feature request like that. Maybe it makes sense to formulate the feature request more generally, for example to have an index - level view on snapshots instead of a snapshot id - based view, i.e., have an API where you can ask what the snapshots are that contain index XYZ. The API could then return size of files referenced by each snapshot and also total size.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.