Elasticsearch 2.0 and Spark - TimestampType conversion issue

eliasah · November 9, 2015, 10:43am

I'm trying to write a DataFrame into Elasticsearch 2.0 with the following schema

|-- actionId: long (nullable = true)
|-- userId: long (nullable = true)
|-- saleDate: timestamp (nullable = true)

When the index is created during the job the saleDate fields seems to be converted into a long. Here is a part of the mapping :

"saleDate": { "type": "long" },

Is this behavior expected? If so How would it be possible to write a time-stamp field without declaring the mapping before-hand?

costin · November 9, 2015, 12:47pm

There's no functionality in ES-Hadoop to force a certain type conversion outside the existing type and that is on purpose.
Elasticsearch can do that much more reliably and better than the connector by declaring the mapping (sometimes templates help a lot) apriori.

eliasah · November 9, 2015, 2:51pm

Not even if we give the schema of the DataFrame when we want to write to Elasticsearch?

costin · November 9, 2015, 3:57pm

No. The schema is simply Spark's representation of the data. The mapping in ES, is its own.
The conversion in ES relies on conventions and if needed, is pluggable through ValueReader/ValueWriter.

By the way, have you tried using the es.mapping.date.rich parameter introduced in 2.1?

Lior_Baber · June 29, 2016, 6:09am

it happened to me as well see my Stackoverfollow quetsion

you can see in my code that I did use the es.mapping.date.rich parameter (which suppose to be true by default)
any new update regarding this?

Thanks

Topic		Replies	Views
ElasticSearch Spark Elasticsearch es-hadoop	3	970	July 6, 2017
Spark How to store DateType As Date not as Long Elasticsearch es-hadoop	2	779	March 14, 2017
Write Dataframe to ElasticSearch with time-stamp column Elasticsearch	1	504	July 5, 2017
EsHadoopInvalidRequest: failed to parse timestamp [2016-03-21 11:41:21,204] Elasticsearch es-hadoop	2	1103	July 6, 2017
What's the actual data type for a field defined as "date" with "dateOptionalTime" Elasticsearch es-hadoop	3	1134	March 14, 2017

Elasticsearch 2.0 and Spark - TimestampType conversion issue

Related topics