We would like to seek your recommendation on log data retention, as we are implementing the Elastic Stack on-premise. To align with regulatory requirements, we must store the data on NAS storage using the Hot, Warm, Cold, Delete, and Snapshot features.
Given this requirement, could you please advise which external storage solution is recommended for our RedHat Linux environment—NFS share, SMB, or iSCSI?
I believe the cold storage tier relies on searchable snapshots, which is a feature that require a commercial license. If you have this or plan to get one I would recommend you reach out to Elastic and see if a solution architect can help with sizing and capacity planning.
Without searchable snapshots you are otherwise looking at a hot-warm architecture. This does , as outlined in this old blog post, rely on having nodes with different hardware profiles in the different tiers. The hot tier usually requires fast local SSD storage as it can be very I/O intensive, but this will to some extent depend on the use case’s ingest volumes, query performance, and retention requirements which you have not at all provcided any detail on.
If we disregard any performance implications of using networked storage it is important that any storage used with data nodes behave and provide exactly the same durability guarantees as local storage, so I would suspect iSCSI might be the best option.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.