Huge database I'm wondering if Elasticsearch can handle

razvant · September 9, 2015, 10:53pm

Hello!

I need to store a maximum of 200 billion records, each having 2 fields: A, and B, both strings of no more than 255 characters.
I'll be inserting records into this database at a pace of about 50 000 per second.
About once a second, I'll also be querying the database. All the queries will be the same: I'll need all the records where the field A=X, for a given X string.

Is it possible to use Elastic Search to store such a database?
What kind of hardware would I need to store it?

Thanks,
Razvan

magnusbaeck · September 10, 2015, 3:49am

Is this the only way you're going to use the database? Asking the whole dataset the same question every second isn't efficient since you only need to consider the data that has been inserted since the last time. I'd consider pushing the records to a broker and have a program process each new record and keep track of how many there are with A=X. If events expire one would have to deal with that in some way too.

warkolm · September 10, 2015, 4:34am

ES can handle this, using filtered queries will be the optimal way too.

razvant · September 10, 2015, 4:34pm

I wouldn't be asking the exact same question. X is different on every query.

otisg · September 11, 2015, 12:56am

Yes, ES should be fine with this.... if you give it enough/appropriate hardware, of course. Super small/narrow docs and the cheapest possible queries. Just don't go updating a large number of docs at once and try to avoid shard reallocation.

Otis

Monitoring * Alerting * Anomaly Detection * Centralized Log Management
Elasticsearch Consulting & Support * http://sematext.com/

Topic		Replies	Views
Storage on elastic search Elasticsearch	14	1091	January 23, 2023
Huge Cluster Elasticsearch	5	1009	July 5, 2017
Configuration of elasticsearch to index 300 Million documents Elasticsearch	4	5921	July 6, 2017
Index 50 million record and search Elasticsearch	2	895	August 17, 2017
Use case Elasticsearch	8	315	July 6, 2017

Huge database I'm wondering if Elasticsearch can handle

Otis

Related topics