Effective separation of tenant data in latest release of ElasticSearch

Rajesh_Kishore · November 14, 2018, 3:18am

Hi All,

We want to use ElasticSearch as a multi-tenant store , each tenant would have different requirement for document type/schema.

What is the best way to store data wrt cost, manageability in this regard ?

1> Each tenant having separate index with varying document types may not be efficient?

2> A set of tenants may fall into one index with varying document types
but With ElasticSearch's removal of mapping types mentioned in link
It seems to be possible only through have custom type as mentioned in the link.

Please advise what is the best possible way to seperate tenant's data with each tenant having separate schema/document type requirement?

Thanks,
Rajesh

warkolm · November 14, 2018, 4:30am

That custom type is literally just a field and value, there's nothing special about it.

Rajesh_Kishore · November 14, 2018, 5:19am

so could you pls advise what is the best strategy?

warkolm · November 14, 2018, 5:37am

If you want to separate by customer then you will probably need to separate out documents that are not similar, perhaps you will need multiple indices per customer.
If you want to group by document similarity then that would be ok, you just need to manage multi-tenancy with something like Security.

The best solution is one that works for you out of those, they both have pros and cons.

Rajesh_Kishore · November 14, 2018, 6:53am

But multiple indices per customer , wont affect performance ? and we wont have similar document type per tenant / or across tenant

Christian_Dahlqvist · November 14, 2018, 6:54am

How many tenants are you expecting? How much control do you have over the data?

Rajesh_Kishore · November 14, 2018, 6:56am

There can be many because initially we will have lot of free customers. Its not possible as of now to quantify how much as this is the cloud service we are building

Christian_Dahlqvist · November 14, 2018, 6:59am

As mappings have to be consistent per index, you will need to impose some control on the content and mappings if you want tenants to shard indices. This is usually necessary as having an index per tenant scales badly. Having lots of small indices will result in performance problems.

There are no easy solutions, but I have seen users place controls on the data and have small users share indices and let a smaller number of larger users have their own.

I have seen users try going with one index per tenant and then deploy this across a lot of small clusters. This reduces the size of the cluster state per cluster but also does not necessarily scale well.

Rajesh_Kishore · November 14, 2018, 7:02am

Got the idea to some extent, let me put more research on this , I will come back to this. In the meantime, more suggestions are highly appreciated.

Christian_Dahlqvist · November 14, 2018, 7:02am

This has been asked before, so you may find additional points if you search the forum.

system · December 12, 2018, 7:02am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Multi-tenancy best practices Elasticsearch	1	465	July 6, 2017
[ES 2.3] Modeling a heterogenous multi-tenant document store Elasticsearch	2	528	February 2, 2017
In a multi tenant application, is it advisable to use types to separate each tenant data Elasticsearch	2	496	April 2, 2017
Elasticsearch Multitenancy Support Elasticsearch	4	1008	July 5, 2017
How to implement multi tenant environment in Elasticsearch Elasticsearch	19	6104	September 28, 2023

Effective separation of tenant data in latest release of ElasticSearch

Related topics