Create-or-replace index programmatically

mmu · May 26, 2020, 1:15pm

Hi,

I have a JAVA web service that uses an ES index for caching. Sometimes the field/index definitions change so the ES index must be re-created.

What I am looking for is a solution pattern (or API function) that:

Creates index and mapping if it is missing
Deletes and re-creates index and mapping if the index definition is not in sync with an expected definition

dadoonet · May 26, 2020, 4:26pm

Creates index and mapping if it is missing

I wrote a project which does something like this:

You can see it in this demo project. Index settings.

Deletes and re-creates index and mapping if the index definition is not in sync with an expected definition

That's super dangerous because it will delete all the existing data.
But there is an option in Beyonder here:

github.com

dadoonet/elasticsearch-beyonder/blob/main/src/main/java/fr/pilato/elasticsearch/tools/ElasticsearchBeyonder.java#L103


      
          	public static void start(RestClient client, String root) throws Exception {
          		start(client, root, Defaults.ForceCreation);
          	}
          
          	/**
          	 * Automatically scan classpath and create indices, mappings, templates, and other settings.
          	 * @param client elasticsearch client
          	 * @param root dir within the classpath
          	 * @param force whether or not to force creation of indices and templates
          	 * @throws Exception when beyonder can not start
          	 */
          	public static void start(RestClient client, String root, boolean force) throws Exception {
          		logger.info("starting automatic settings/mappings discovery");
          
          		// create index lifecycles
          		List<String> indexLifecycles = ResourceList.getResourceNames(root, Defaults.IndexLifecyclesDir);
          		for (String indexLifecycleName : indexLifecycles) {
          			createIndexLifecycle(client, root, indexLifecycleName);
          		}
          
          		// create component templates

Using force will remove any existing index if exists. Not that it will not compare the existing schema with the one that you want to apply. That could be a good option to add to this project though. But it is still super dangerous to me and I'd only use that for integration testing not for production code.

Note that the merge option can try to update an existing mapping but it will work only in a few circonstances (like adding a new field).

mmu · May 26, 2020, 9:40pm

Thanks for the hint! - But I am really looking for something robust that detects index definition changes and performs a complete re-creation if definitions have changed. Data loss is not an issue - ES is not the primary data store and can be rebuilt within minutes. It is just not convenient to force that kind of re-creation on every service restart.

dadoonet · May 27, 2020, 3:53am

You need to write yourself this logic then.
I'm not sure if spring data elasticsearch or hibernate search support that but this is something you might want to check.

If you do your own implementation, I'd be happy to welcome it as a PR in the beyonder project.

mmu · May 29, 2020, 7:13am

I see three options how information about the last "valid" index definition could be persisted:

Hash the original index definition and store hash it in some metadata field in ES (I am not sure there is an appropriate field to do this - is there?)
Hash the original index definition and store hash it in a persistent location (usually File System)
Keep the original index definition and store it in a persistent location (also FS)

On startup, the current _settings.json could be checked against the persisted hash/definition and force re-creation if they don't match.

My preference would be (1) but I could not yet find an appropriate customizable metadata field at the index level. Options (2,3) require configuration of some persistent file location, which would work for me but is probably less desirable for this kind of library.

dadoonet · May 29, 2020, 8:01am

Good idea. I'd use https://www.elastic.co/guide/en/elasticsearch/reference/current/mapping-meta-field.html

system · June 26, 2020, 8:01am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Would like to programmatically create indices with mappings in java: 6.1.3 HighLevelRestClient Elasticsearch	6	1474	March 4, 2018
Create new index or update existing mapping to dynamic using Java API Elasticsearch	6	1975	November 22, 2017
Create index with org.elasticsearch.client Elasticsearch	1	388	May 6, 2019
8.2 closing connection when creating an index Elasticsearch	6	905	July 13, 2022
Index missing error Eelasticseach java Elasticsearch	3	467	July 6, 2017

Create-or-replace index programmatically

Related topics