Best ways to prevent data lost in elastic

hoey123 · July 15, 2018, 11:44am

we decide to use Elasticsearch for our data store and handle different search scenarios with that.
data is critical and should never lost. we should store it for 10 days.
because data is time series we want to create index hourl. so different queries route into different indices. the other reason we choose hourly index is easy to remove older data ( after 10 days)
because we choose elasticsearch as a primary data store , we concern about loosing our data.
does Elasticsearch have solution for handling data lost?
is snapshot and restore suitable for that?
or we should have a primary storage and using Elasticsearch as a secondary data store.

dadoonet · July 15, 2018, 1:59pm

Read https://www.elastic.co/guide/en/elasticsearch/resiliency/current/index.html

If you absolutely need 100% guaranty, I'd use logstash to send data to elasticsearch and a datastore (HDFS, FS, Oracle, PostgreSQL, ...).

That way you will have data in both places in case of any hard crash.

hoey123 · July 15, 2018, 2:29pm

1- I decide to create hourly indices and store snapshots for each of them.
so for handling data lost for already indexed data, I think I can restore corrupted index from it's snapshot.

2- on the other hand, for handling data lost in insertion time, I think about using logstash with kafka ( logstash consumes data from kafka).

is this solutions overcome data lost ?

dadoonet · July 23, 2018, 8:40am

Yes. Most likely
Yes. That could be a good safeguard.

You can always consume the data from Kafka with Logstash and ask Logstash to store the raw data to whatever datastore you want (S3, HDFS, ...) and index the enriched data to Elasticsearch.

system · August 20, 2018, 8:40am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Single node cluster vs multi node cluster compared to possible undetected data-loss Elasticsearch	1	1969	July 5, 2017
Elasticsearch 2.3 as primary data store? Elasticsearch	3	7798	July 5, 2017
Using ElasticSearch as Primary Data Store Elasticsearch	8	1866	July 6, 2017
What is the status of resiliency with the latest ES version? Elasticsearch	2	1056	February 23, 2018
Elasticsearch as a primary database Elasticsearch	38	78538	July 4, 2017

Best ways to prevent data lost in elastic

Related topics