Index binary files


(benjelloun) #1

hello,

I installed ElasticSearch, its work good i can index and search xml and
json content using Dev HTTP Client.
I need your help to index binary files in elasticsearch then search for
them by content.
I added mapper-attachements to elastic search but what i dont know is how
to specify the folder of pdf or docx files to index it. something like
base64 or i dont know.
Thanks for helping me.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/787f6815-408a-4ef7-bfd3-a5ee6cc02798%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


(Radu Gheorghe) #2

Hello,

Normally, you would send indexing requests to the REST API with the stuff
you want Elasticsearch to index:
http://www.elasticsearch.org/guide/en/elasticsearch/reference/current/docs-index_.html

If you want Elasticsearch to automatically fetch files from the file system
for you, have a look at David's FileSystem River:

Best regards,
Radu

Performance Monitoring * Log Analytics * Search Analytics
Solr & Elasticsearch Support * http://sematext.com/

On Tue, Apr 29, 2014 at 6:40 PM, anass benjelloun anass.bnj@gmail.comwrote:

hello,

I installed ElasticSearch, its work good i can index and search xml and
json content using Dev HTTP Client.
I need your help to index binary files in elasticsearch then search for
them by content.
I added mapper-attachements to elastic search but what i dont know is how
to specify the folder of pdf or docx files to index it. something like
base64 or i dont know.
Thanks for helping me.

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/787f6815-408a-4ef7-bfd3-a5ee6cc02798%40googlegroups.comhttps://groups.google.com/d/msgid/elasticsearch/787f6815-408a-4ef7-bfd3-a5ee6cc02798%40googlegroups.com?utm_medium=email&utm_source=footer
.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CAHXA0_2UQpB63eye_Yii0KiGYXiMj8Q6v3swRrxxYNk5jiMxpQ%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.


(benjelloun) #3

Hello again,

thanks for your answer.
But my probleme is how to specify the folder which have many documents to
index all of these document.
and i tested already indexing pdf file with giving base64 content and that
didn't work.

Best regards,
Anass BENJELLOUN

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/cf373e11-f548-4b62-9692-e6fe79888cc2%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


(David Pilato) #4

Hard to tell without details of what you did so far and what did not work.

May be you could start describing all the steps from the beginning (provide versions for all component you are using as well)?

--
David Pilato | Technical Advocate | Elasticsearch.com
@dadoonet | @elasticsearchfr

Le 6 mai 2014 à 17:30:28, anass benjelloun (anass.bnj@gmail.com) a écrit:

Hello again,

thanks for your answer.
But my probleme is how to specify the folder which have many documents to index all of these document.
and i tested already indexing pdf file with giving base64 content and that didn't work.

Best regards,
Anass BENJELLOUN

You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/cf373e11-f548-4b62-9692-e6fe79888cc2%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/etPan.53690096.b03e0c6.5c36%40MacBook-Air-de-David.local.
For more options, visit https://groups.google.com/d/optout.


(system) #5