Multiple Field as mapping iD

padhu1989 · June 8, 2018, 7:53am

I am pretty new to elastic search. I am using elasticsearch-hadoop 6.2.4 version and I am reading the files from HDFS, converting to bean object and writing to elastic search. I am using Spark Structured streaming.

    StreamingQuery query = dataSet
                    .writeStream()
                    .format("org.elasticsearch.spark.sql")
                    //.outputMode(OutputMode.Append())
                    .option("checkpointLocation", "\tmp\ckpt1")
                    .option("es.nodes","abc.dev.cm.par.xy.hp")
                    .option("es.port","9200")
                    .option("es.mapping.id", "CustomerID")
                    .option("es.resource", "testIndex/testType")
                    .start();

While writing i am giving one of the field (CustomerID)in the pojo class as mapping iD. Can we give multiple fields or combination of fields as mapping ID? For example, my file contains customer id as well as order id fields. Can we combine these both fields as CustomerID+Order ID something like that?

james.baiera · June 21, 2018, 3:20pm

If you create a new id field during your spark processing, you can reference that id field in the es.mapping.id setting. Granted, this will write out this id field to Elasticsearch, so if you do not want the field to be ignored after the id is extracted, you can include its name under es.mapping.exclude

system · July 19, 2018, 3:21pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Can we have concatenation of multiple fields in "es.mapping.id" Elasticsearch	1	597	June 21, 2018
How to update documents using spark Elasticsearch es-hadoop	2	1512	December 10, 2016
Es.mapping.id field duplication not allowing Elasticsearch es-hadoop	2	906	June 6, 2017
Avoid es.mapping.id field duplication Elasticsearch es-hadoop	3	2989	December 28, 2016
[elasticsearch-hadoop] How to specify es.mapping.id value from inside a map? Elasticsearch es-hadoop	2	2362	January 17, 2018

Multiple Field as mapping iD

Related topics