Charts to represent information extraxted from pdf and images with fscrawler, elasticsearch and kibana

kumar4 · May 3, 2019, 10:56am

I implemented fscrawler, Tesseract with kibana, and elastic search for pdf, and it is possible to know if it is possible to some statistics in graphical charts (example: to know the occurrence of word in each pdf or image) is there any solution or tool that can perform that

dadoonet · May 3, 2019, 11:34am

You can enable fielddata on the content field and use Kibana to display a tag cloud on that field.

Note that it can use a lot of HEAP memory.

But that won't be per document.

kumar4 · May 6, 2019, 1:04am

i didnt understanf can yu explain me how, to give more details i created deirectory in tmp ccalled /tmp/es and puted my files and creating index called testindex but i use kibana dev tool console to dilay content of my pdf and images, how can i visualiz that in kibana without the query console; give the stemps to read data in kibana plz

dadoonet · May 6, 2019, 10:03am

I'm afraid I can't do that.
I'd first learn what elasticsearch and kibana are by following some guides/tutorials:

About fielddata you can read this: https://www.elastic.co/guide/en/elasticsearch/reference/current/fielddata.html

HTH

system · June 3, 2019, 10:03am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Counting the frequence and visualtsing information about index with fscrawler in chart using kibana Kibana	2	320	May 7, 2019
Visualizing the count of words in each document(pdf, word) in kibana using FSCRAWLER Kibana	4	1063	February 21, 2018
How can we visulize fscrawler index in kibana without using dev tool console Elasticsearch	4	369	June 3, 2019
Extracting texts from Flatten/ Scanned PDF Documents in Kibana Kibana	2	239	September 21, 2023
Query to get content that match some value in pdf and text extraxcted from iamge Elasticsearch	2	354	May 2, 2019

Charts to represent information extraxted from pdf and images with fscrawler, elasticsearch and kibana

Related topics