Elasticsearch on Spark - Index Mapping

eliasah · August 13, 2015, 2:54pm

Hello everyone,

I'm trying to write a Spark job reading a CSV file and writing into Elasticsearch in Java:

I'm defining my a class object with the following instance variables :

I have succeeded writing my data into Elasticsearch but I wish to specify that title shouldn't be analyzed.

How can I do that within the Spark job?

Thanks in advance!

costin · August 13, 2015, 7:38pm

You need to define the mapping before hand in Elasticsearch. There are various reasons why this is like this:

it's a one time thing while a job can run multiple times.
there is no clear life-cycle hook that the connector can use across all integrations to add the mapping. Also things like versioning, merging conflicts, etc... are not easy to resolve and outside the scope of the connector.

eliasah · August 14, 2015, 6:18am

That's what I suspect. Thanks for your answer!

Cheers.

Topic		Replies	Views
(apache spark df).saveToES(elastic search) Elasticsearch es-hadoop	3	2053	March 26, 2017
JavaEsSpark.saveToES not using pre-defined mapping fields while posting the data to ES cluster Elasticsearch es-hadoop	9	2405	April 9, 2017
How to use ElasticSearch hadoop connector to create type automatically from Streaming DataSet<Row>? Elasticsearch es-hadoop	18	2476	March 1, 2018
Elasticsearch and spark Elasticsearch	7	1171	July 6, 2017
Multiple Field as mapping iD Elasticsearch es-hadoop	2	1364	July 19, 2018