I feel like 'ive tried just about everything to achieve this to no success, so I won't bother going into great detail about all my tests. So far ive attempted getting my Raw JSON file into elasticsearch using curl, jsonpyes and logstash but none of these methods have worked. I would use the bulk API but this would mean adding headers for every line in a giant raw JSON file which I don't know how to do.
If anyone else has worked with getting large JSON files into elasticsearch i'd love to hear how you achieved this. Seems quite ironic how hard it is to get large JSON files into elasticsearch considering JSON is the supposed preferred input for elasticsearch.
Im unsure what you mean by documents, but the structure of my JSON looks like the attached images. I've noticed that the last 3 values of every field are null but I don't think that is what is causing the error.
the jsonpyes tool analysed the raw JSON as valid but im not sure if its accurate or not either.
That looks like a very, very large document with a lot of fields. How many fields are there in the document? What does all the numeric keys shown in the screenshot represent?
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.