Hi all! I was wondering for some time now why doesn't Elasticsearch allow for native CSV import? I don't mean converting the CSV to JSON and then using the Bulk API, but uploading a CSV to the server and then the server does the importing. Or maybe it does but I can't find it.
I will list down the number of benefits of this if this would be the case:
- Direct import from different DB servers that can generate data dumps (psql, mysql) easily and natively so less conversion, and more interoperability with legacy systems.
- Less time spend converting the csv to JSON on the client side to valid JSON saving CPU cycles and trees.
- Less data to transfer from the client to the server for mostly tabular data really, saving data and bandwidth, saving sanity and trees. Json is really quite overkill if we're mostly sending tabular data with tis repeating of props.
The downside will be:
- nested documents are a bit jerky in csv, but those people can still use the plain Bulk API.
For me personally it seems like a good addition to Elasticsearch that covers 80% of the use cases.
Would love to know what the reason is!