Good practice in distributing data in indexes

Mathias_Muntz · June 14, 2019, 10:43am

Hello everyone,

Here is a small description of the situtation.

We have approximately 5,000 customers who use a tool connected to a Postgres database. Each client can manipulate the data in a database of its own. Each database contains approximately 30 tables. The model of each base is identical. So we have this:

db_customer_1

table_1
table2
...
table_30
...
db_customer_5000
table_1
...

We would like to index all this data in Elasticsearch. But we can not choose how to create our indexes.

First proposal:
We put all the data of the table_1 of each database in an index called index_table_1, with an identifier on the document allowing to know the customer. About 30 indexes.

Second solution:
We put the data for each table_1 for each customer in a single index called index_customer_1_table_1. At present, approximately 150000 indexes.

We are afraid that the first solution will cause performance issues because the data, even if based on the same model, does not belong to the same client.

We are afraid with the second solution, because the increase in the number of indexes can be a problem in the medium term.

Can you help us ?

Thank you

Christian_Dahlqvist · June 14, 2019, 12:20pm

The number of indices in option 2 will be an immediate problem so I would recommend option 1.

Mathias_Muntz · June 14, 2019, 12:48pm

Thank you for your reply.
Is there a way a specific mapping or something else that I can put on the field customer_id of my document? Like Indexe in DBMS. To allow my aggregation to be as efficient as possible? Because they will always start with match:{customer_id:****,....}

system · July 12, 2019, 12:48pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Many indexes problem Elasticsearch	6	308	July 6, 2017
Change data from database Elasticsearch	4	614	September 21, 2020
Performance with an arbitrary number of indicies Elasticsearch	2	335	July 6, 2017
What is the maximum number of indices allowed in elasticsearch? Elasticsearch	2	1114	May 24, 2022
Multiple indices vs. multiple shards approach Elasticsearch	10	2260	November 4, 2022

Good practice in distributing data in indexes

Related topics