Is it true that sometimes elasticsearch aggregations can be inaccurate?

It has to do with scale, not load. Some type of aggregations are approximates as calculating exact values fast at large scale is not possible. Examples are the cardinality aggregation, significant terms aggregations and also the terms aggregation (in cases of high cardinality).

Have a look at the following threads that discuss this: