Transforming into JSON-Documents

Werner · February 1, 2021, 6:33pm

Hello !

I am an absolute beginner in elasticsearch and after some hours of reading an watching tutorial-videos I got the impression, that it needs a lot of knowledge and preparation, before I can make a fast search.

In a tutorial-book for Elasticsearch i can read:
"Elasticsearch is a search-server, which can store JSON-Documents and (then) search in them."

Does this mean, that i have to transform all my huge data-amount first into JSON-Documents
before I can search in them?
I guess the ELK-Stack or Elasticsearch does the transforming into JSON-Documents for me.
Is that correct?
I guess furthermore the ELK-Stack or Elasticsearch does building of the indices, which i use for fast searching, for me.
Is that correct?

Thank you for helping me!

warkolm · February 1, 2021, 8:21pm

Elasticsearch will turn your data into json when it receives it. That might not end up in the best format that you want, so it can make sense to do that before sending to Elasticsearch.

Elasticsearch handles all the creation and searching of the indices.

Werner · February 1, 2021, 8:51pm

Hallo,
and thank you very much for your answer!

You tell, that JSON-data that Elasticsearch creates is often not in an optimal format.
I guess certain tools are usually used to transform the origin data into well-formated-JSON.
What kind of tools i can use to transform the origin data into well-formated-JSON.

Thank you for helping me.

warkolm · February 1, 2021, 9:39pm

That depends on the type of the source data.

Werner · February 1, 2021, 9:53pm

Hallo,
and thank you very much for your answer!
I assume you mean that for each type there is one certain tool!?
For example for csv-formated data there is one certain tool?
Or for plain ascii-files there is one certain tool?

Is there a transforming tool that recognizes the type for each part of the source data?

Thank you.

warkolm · February 1, 2021, 9:57pm

There's things like Filebeat for event driven data.
Or Logstash can do both event and document style.

Werner · February 1, 2021, 10:08pm

Hallo, and thank you.
I don't understand that. Do you mean, that both Filebeat and Logstash transform data into well formed JSON. And both Filebeat and Logstash can recognize the type of the source data by using certain technics?
Can you explain event driven and document style in this context? Thx.

warkolm · February 3, 2021, 3:49am

What sort of data are you looking to ingest here?

Werner · February 3, 2021, 6:53pm

Hallo, and thank you.
What I understand is that Filebeat and Logstash transform all or almost all arbitrary data into well formed JSON data.
I would like to know which tools I use for plain ascii text files, and which for Office files like .xls or .doc an which for binary files like .png files and which for .pdf files and which for csv files and which for html-files and which for xml-files and which for log-files.
As principle all human readable files, that means also with programming source code ( .cpp-files .java-files) (I know binary files like png ar not human readable)
I would be thankful for a short overview.

Greetings from the Rhine in Germany.

warkolm · February 8, 2021, 4:48am

Filebeat is mostly for event based data, ie stuff with timestamps. Logstash has traditionally been used for that, but can be used for other formats too. They will both transform the incoming data into JSON.

Regarding the other formats, there are tools that can extract data from binary files. There's nothing native to the Elastic Stack that can do this.

system · March 8, 2021, 4:49am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Is there any other way of indexing data into elasticsearch other than json Elasticsearch	5	3798	July 5, 2017
Feed Elasticsearch directly from a web service URL? Elasticsearch	8	4305	July 6, 2017
Transforming data WHILE indexing Logstash	9	817	July 6, 2017
Fs - json converter Elasticsearch	1	432	July 6, 2017
Elastic Search Data format conversion Elasticsearch	11	551	December 4, 2018

Transforming into JSON-Documents

Related topics