I'm trying to figure out our basic scale unit based on this article: https://www.elastic.co/guide/en/elasticsearch/guide/current/scale.html
Hardware is fix, and I use one primary shard without any replica.
I measure the average response time with JMeter and my goal is to keep it around 500 ms for 50 parallel users.
- for 20.000 documents response time is 500 ms
- for 100.000 documents response time is around 1 sec
- for 7 million documents response time is 1.5 sec
This is not a linear scaling and 20.000 documents seems a very low amount for my needs.
Could you please help me, what can be the problem? How can I figure out my basic scale unit?