My Mapping with python is not working

Celal_Celik · April 16, 2022, 9:27am

Hello community,
I was trying to upload a simple csv-File but unfortunatelty my Mapping is not working. All my fields are still "keywords". What am I missing?

def csv_reader(filename,indexname):
    # connection to ElasticSearch: port and host are variable, standard is like this
    es = Elasticsearch([{'host': 'localhost', 'port': 9200}])
    # if es is reached, it prints connected else it reads not connected
    if es.ping():
        print("connected")
    else:
        print("Not connected")

    with open(filename, 'r') as outfile:
        reader = csv.DictReader(outfile)
        Settings = {
            "settings": {
                "numer_of_shards": 1,
                "number_of_replicas": 0
            },
            "mappings": {
                "members": {
                    "dynamic": "strict",
                    "properties": {
                        "_id": {
                            "type": "long"
                        },
                        "name": {
                            "type": "text"
                        },
                        "gefundene_fehler": {
                            "type": "long"
                        },
                        "behobene_fehler": {
                            "type": "long"
                        },
                        "updated_time": {
                            "type": "date"
                        },
                        "time": {
                            "type": "date"
                        },
                        "timestamp": {
                            "type": "date"
                        }
                    }
                }
            }
        }
        if es.indices.exists(indexname):
            print("deleting existing index")
            es.indices.delete(index=indexname, ignore=[400,404])
            print("creating new index1")
            es.indices.create(index=indexname, ignore=400, body=Settings)
            helpers.bulk(es, reader, index=indexname)
        else:
            es.indices.create(index=indexname, ignore=400, body=Settings)
            helpers.bulk(es, reader, index=indexname)
            print("creating new index2")
    print("all lines loaded")

dadoonet · April 16, 2022, 9:43am

It sounds like you are using an old version of Elasticsearch. Is that right?

Celal_Celik · April 16, 2022, 9:54am

Hello,
I am using ES 7.10.2

Is it maybe possible, that when I use the csv.DictReader(outfile)- comand, that my columns will be read in a String format?
Thank you

RabBit_BR · April 16, 2022, 7:46pm

The correct is "number_of_shards"

in version 7.10.2 this type will not work

The _id is reserved, change it to another name like "[something]_id"

Celal_Celik · April 17, 2022, 11:11am

Thank you for your response. But unfortunately its not working. I just want to update a csv-File to Elasticsearch. If I try it like this, then I get following errors in PyCharm:

line 399, in bulk for ok, item in streaming_bulk(client, actions, *args, **kwargs)

line 320, in streaming_bulk for data, (ok, info) in zip(

line 249, in _process_bulk_chunk for item in gen:

line 188, in _process_bulk_chunk_success raise BulkIndexError("%i document(s) failed to index." % len(errors), errors)
elasticsearch.helpers.errors.BulkIndexError: ('2 document(s) failed to index.', [{'index': {'_index': 'mdfadf', '_type': '_doc', '_id': 'UmY3N4ABaUVkc-44MXpt', 'status': 400, 'error': {'type': 'mapper_parsing_exception', 'reason': "failed to parse field [time] of type [date] in document with id 'UmY3N4ABaUVkc-44MXpt'. Preview of field's value: '2022-01-28 13:03:29'", 'caused_by': {'type': 'illegal_argument_exception', 'reason': 'failed to parse date field [2022-01-28 13:03:29] with format [strict_date_optional_time||epoch_millis]', 'caused_by': {'type': 'date_time_parse_exception', 'reason': 'Failed to parse with all enclosed parsers'}}}, 'data': {'name': 'ProjectR', 'gefunde_fehler': '7', 'behobene_fehler': '9', 'time': '2022-01-28 13:03:29', 'updated_time': '2022-04-17 13:10:01.337150'}}}, {'index': {'_index': 'mdfadf', '_type': '_doc', '_id': 'U2Y3N4ABaUVkc-44MXpt', 'status': 400, 'error': {'type': 'mapper_parsing_exception', 'reason': "failed to parse field [time] of type [date] in document with id 'U2Y3N4ABaUVkc-44MXpt'. Preview of field's value: '2022-02-16 17:00:26'", 'caused_by': {'type': 'illegal_argument_exception', 'reason': 'failed to parse date field [2022-02-16 17:00:26] with format [strict_date_optional_time||epoch_millis]', 'caused_by': {'type': 'date_time_parse_exception', 'reason': 'Failed to parse with all enclosed parsers'}}}, 'data': {'name': 'ProjectB', 'gefunde_fehler': '2', 'behobene_fehler': '8', 'time': '2022-02-16 17:00:26', 'updated_time': '2022-04-17 13:10:01.337150'}}}])

This is my mapping:

    Settings = {
        "settings": {
            "number_of_shards": 1,
            "number_of_replicas": 0
        },
        "mappings": {
                "properties": {
                    "name": {
                        "type": "text"
                    },
                    "gefundene_fehler": {
                        "type": "long"
                    },
                    "behobene_fehler": {
                        "type": "long"
                    },
                    "updated_time": {
                        "type": "date"
                    },
                    "time": {
                        "type": "date"
                    },
                    "timestamp": {
                        "type": "date"
                    }
                }
            }
        }

Celal_Celik · April 17, 2022, 11:14am

If I try to upload the csv-File in Kibana with the import csv-File funciton, then I get the following error:

I don't know how to solve this issue.
Thanks

dadoonet · April 17, 2022, 11:55am

You need to fix the date format for your date fields.

RabBit_BR · April 18, 2022, 1:55am

You need to apply date formats to your field.

to '2022-02-16 17:00:26' use yyyy-MM-dd HH:mm:ss.

to '2022-04-17 13:10:01.337150' use strict_date_optional_time_nanos.

Note that you need to add the 'T' in values for "updated_time" like this: 2022-04-17T13:10:01.337150 or save without the nanoseconds.
If you choose a date without nanosecond, you can use the field format time.

  "updated_time": {
          "type": "date",
          "format": "strict_date_optional_time_nanos"
        },
        "time": {
          "type": "date",
          "format": "yyyy-MM-dd HH:mm:ss"
        },

See more info here:

Celal_Celik · April 18, 2022, 10:34am

Hello,
first of all I want to thank you both for your help. It worked finally. You guys are legends.

                   "updated_time": {
                        "type": "date",
                        "format": "yyyy-MM-dd HH:mm:ss"
                    },
                    "time": {
                        "type": "date",
                        "format": "yyyy-MM-dd HH:mm:ss"
                    }

This is my mapping and instead of

datetime.now()

I used for updated_time

datetime.now().strftime("%Y-%m-%d %H:%M:%S")

Thanks again for your help

system · May 16, 2022, 10:34am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Loading CSV to elasticsearch index with mapping using Python API Elasticsearch	2	7223	March 19, 2018
Elastic Search Python mappings Elasticsearch	2	5312	April 13, 2017
ES ignores mapping when using Python DSL bulk Elasticsearch	2	1277	March 19, 2018
How to update the maapings of a certain index in Elastic Search 8.4.1. using Python? Elasticsearch language-clients	1	220	January 10, 2023
Python index CSV into ES date issue Elasticsearch	3	817	January 26, 2021

My Mapping with python is not working

Related topics