Creating a mapping using hadoop configuration

caskEmployee · July 23, 2015, 7:23pm

Hi,

I want to put a mapping in an index using elasticsearch-hadoop. Is there a way to create the mapping using the configuration? I've only found information on creating a mapping with the java api by using a node and client.

Thank you

costin · July 25, 2015, 11:41am

The connector can create a mapping based on your schema definition where that is supported (such as Pig, Hive, Spark).
Outside of it, one has to create the mapping externally for several reasons:

ease of use; all the corner cases such as merging the mapping, deleting the previous one, etc.. are better handled
there are no guarantees in the connector - since it creates multiple tasks, neither Map/Reduce or Spark guarantee what tasks will be executed first and thus the tasks cannot properly coordinate in creating the mapping. One can try and add the mapping, while the other tasks might perceive it as an override.
Additionally, there's no clear lifecycle between when a task hits the cluster vs config time.
That is, we don't know for sure across the Map/Reduce, Hive, Pig, Cascading APIs whether the task has been validated and can actually start vs it's just the OutputFormat being instantiated in the chain.

In other words, it's a lot easier and safer to create this externally even more so since it's a one-time thing as oppose to a job which is typically ran several times across different data sets.

caskEmployee · July 27, 2015, 7:58pm

Thank you for your help!

Topic		Replies	Views
[HADOOP] Add custom mapping configuration key Elasticsearch es-hadoop	1	395	September 4, 2020
Elasticsearch on Spark - Index Mapping Elasticsearch es-hadoop	3	1012	July 6, 2017
Elastcisearch Hadoop customized index mapping? Elasticsearch es-hadoop	2	585	June 27, 2018
How to push data from Hadoop to ES? Elasticsearch es-hadoop	6	4157	July 21, 2017
Connecting hadoop and elasticsearch Elasticsearch es-hadoop	3	1453	September 20, 2019

Creating a mapping using hadoop configuration

Related topics