(apache spark df).saveToES(elastic search)

Muthu_Jayakumar · February 11, 2017, 3:16pm

Hello there,

I am new to elastic-search + apache-spark combination. I have a question on how I would be able to set datatypes in ElasticSearch for a dataframe I would be saving?
For example, when I take a dataframe to save, some columns gets saved as 'text' type. I would like them to be saved as 'keyword' type. Also, would I be able to update the mappings before the dataframe is written into index or it can only be done after an write operation?

Please advice,
Muthu

james.baiera · February 14, 2017, 8:49pm

@Muthu_Jayakumar ES-Hadoop defers to Elasticsearch for automatically mapping fields. This means that string data is going to be automatically set to text, dates will also be set to text types, and so on.

If these mappings are not what you expect, then you must create the index and mappings on your own before sending documents with ES-Hadoop/Spark. Another simple way to handle this if you are working with multiple indices is to use index templates in Elasticsearch.

Muthu_Jayakumar · February 26, 2017, 4:43pm

Thanks for the note James. The index templates trick should work for me.

system · March 26, 2017, 4:43pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Spark-ES saveToES without type Elasticsearch es-hadoop	2	2097	April 13, 2018
Writing Dataframe to Elasticsearch using scala Elasticsearch es-hadoop	4	2371	February 4, 2019
SparkSQL Index Mapping, Partition issues Elasticsearch es-hadoop	2	1178	July 6, 2017
Spark How to store DateType As Date not as Long Elasticsearch es-hadoop	2	756	March 14, 2017
Save PySpark DataFRame to ElasticSEarch index Elasticsearch	2	1471	September 9, 2020

(apache spark df).saveToES(elastic search)

Related topics