Elasticsearch vs MySQL for large data set using exact value searching

ian410 · June 12, 2018, 5:55pm

I'm evaluating using Elasticsearch vs sharding (or partitioning) MySQL for a record set of around 2.5 billion records of frequently accessed data. The data is simple in the format of (customer_id, product_id, qty, price, and a few other miscellaneous columns). The miscellaneous columns will never be searched against, they are just data needed.

The queries will always be in the form of WHERE customer_id =? AND product_id IN (?) AND qty=?, perhaps occasional WHERE customer_id=? or WHERE product_id=?. For MySQL, these are easily indexed. I've already done some testing and on first load, it is around 0.8 seconds for around 1.1 billion rows and the second load is in the 0.05-second range. In the final solution, this would be sharded over multiple DBs or partitioned over multiple tables so I would expect faster results in either case.

Would elasticsearch perform better on the first pass for this amount of data given the type of searching we are doing? We have a large budget so we can cluster, shard, or get a big server for partitioning extensively in either solution.

Thank you.

system · July 10, 2018, 5:55pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Elasticsearch vs relational database Elasticsearch	2	2112	June 28, 2019
One billion data from MySql imported into ElasticSearch, how ES performance？ Elasticsearch	7	2414	July 6, 2017
Performance: Calculations inside query, ES vs Mysql Elasticsearch	4	7666	July 5, 2017
Elastic search as multi-key value cache Elasticsearch	8	2143	July 6, 2017
Getting started with elasticsearch Elasticsearch	1	268	July 6, 2017

Elasticsearch vs MySQL for large data set using exact value searching

Related topics