Unable to parse csv over 20000 with Data Visualizer

Thodoris_Alexopoulos · June 8, 2020, 5:07pm

Hello everyone,

I am trying to parse a CSV file that is 22000 rows to Data visualizer and it fails with error , file could not be read. Unexpected token < in JSON at position 0. But when I erase 3000 rows the file passes without a problem. I have tried with several files and the pattern is the same. The files do not exceed the 3 MB.

Do I have to change any parameter in my kibana.yml?

James_Gowdy · June 9, 2020, 12:41pm

Hi Thodoris,
I imagine the < character is coming from an html based error response being return by the server rather than the expected JSON .
Which version of elasticsearch are you using?
Can you supply a sample of the data? just a few typical lines will be fine.

Regards,
James

Thodoris_Alexopoulos · June 11, 2020, 6:15am

Thanks for your respond James.

I am using the 7.6 version and I am also trying to parse the following CSV file from https://covid19.who.int/ :

Download Map Data (Click button to Download). Sorry but it is not permitted to upload the csv file.

Thanks

darnautov · June 18, 2020, 9:39am

Hi @Thodoris_Alexopoulos,

Let me to follow up on this one! I downloaded a CSV file you mentioned:
WHO-COVID-19-global-data.csv
sha1 621e103a66b15af0f9e5bc93e0f8ddea4c0e3d3f
22374 rows

Tested on the stack version 7.6.0, both elasticsearch and kibana. Using default settings for both kibana.yml and elasticsearch.yml, single-node cluster. I managed to upload the file and create an index without overriding any settings.

Could you please confirm if the file and stack versions are the same as I mentioned? Also, more details on the configurations of elasticsearch and kibana you have might be useful for troubleshooting the issue.

Regards,
Dima

James_Gowdy · June 23, 2020, 9:17am

Hi @Thodoris_Alexopoulos,

I've experimented with the who covid dataset on a cloud deployment running 7.6, I could not get a timeout when analysing the data, but I did occasionally see one when trying to import it.
So i suspect it is a problem with the amount of data being sent over the network.
Even though the file is not large, it still needs chopping up into chunks that should be small enough to be sent to the Kibana server.

In 7.8 we have made improvements in the parsing and uploading of data.
https://github.com/elastic/kibana/pull/62970

I have tested the same csv file with a 7.8 deployment and with multiple attempts, I could not reproduce a timeout failure.

If possible, I would suggest upgrading to 7.8 as it should hopefully fix this issue for you.

Regards,
James

system · July 21, 2020, 9:17am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Could not import file csv from Kibana or Logstash to Elasticsearch Elasticsearch elastic-stack-machine-learning	3	1347	April 22, 2020
Csv character limitation per row Logstash	12	1150	July 10, 2018
CSV file uploading error Elasticsearch	3	415	May 14, 2022
Kibana / CSV Upload / Request Timeout Kibana elastic-stack-machine-learning	7	2698	September 18, 2019
Machine learning - Data visualiser unable to determine file structure Kibana elastic-stack-machine-learning	3	399	June 8, 2021

Unable to parse csv over 20000 with Data Visualizer

Related topics