I'm trying to index big document with ES and Mapper Attachment plugin
(https://github.com/elastic/elasticsearch-mapper-attachments). Document has
719 pages, but after indexing I can search phrases only up to page 33. When
I index a document I'm base64 encoding the file contents and file get
successfully added to the index. Is there some limits of the size of the
file?
I'm trying to index big document with ES and Mapper Attachment plugin (GitHub - elastic/elasticsearch-mapper-attachments: Mapper Attachments Type plugin for Elasticsearch). Document has 719 pages, but after indexing I can search phrases only up to page 33. When I index a document I'm base64 encoding the file contents and file get successfully added to the index. Is there some limits of the size of the file?
Le 26 mars 2015 à 10:51, Jakko Sikkar <jakko....@gmail.com <javascript:>>
a écrit :
Hi,
I'm trying to index big document with ES and Mapper Attachment plugin ( GitHub - elastic/elasticsearch-mapper-attachments: Mapper Attachments Type plugin for Elasticsearch). Document
has 719 pages, but after indexing I can search phrases only up to page 33.
When I index a document I'm base64 encoding the file contents and file get
successfully added to the index. Is there some limits of the size of the
file?
Can you please share your ES configuration? I want to index many large PDFs, but not sure how to include them together. Do I have to write curl for each separately? Also, need help with this syntax --
PUT /test-mapping/person/1
{
"my_attachment" : {
"_name" : "/home/ubuntu/test.pdf",
"_language" : "en", "_content" : "... base64 encoded attachment ..." ---> Do I have to write content even if I know I want to index the complete file? How to specify the location to read from?
}
}
Hello Sir?
I am new in elasticsearch but I like the way it is a power ful tool
Can you help me please see the documentation we have been using ti index even one pdf file?
Best regards!
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.