at the moment I'm testing ES for the use as an indexing and storaging
solution. I use the following general settings with analysis-icu and
mapper-attachments plugins enabled:
Unfortunately, the content field does not contain any keyword from the
pdf (just the long string) and every search returns nothing. I think,
I misunderstood something - I'm quite new to JSON/Java and similar
technique.
can you double check that the attachment plugin is installed correctly?
make sure both tika and mapper attachment jar files are in <ES_HOME>/lib
folder
For example if you user 0.17.0-SNAPSHOT you should see the following two
files in lib folder:
elasticsearch-mapper-attachments-0.17.0-SNAPSHOT.jar
tika-app-0.9.jar
is attachment plugin recognized when ES startup?
You can see this in log file. Navigate to <ES_HOME>/logs and check ES
startup log records sequence. You should see something like
at the moment I'm testing ES for the use as an indexing and storaging
solution. I use the following general settings with analysis-icu and
mapper-attachments plugins enabled:
Unfortunately, the content field does not contain any keyword from the
pdf (just the long string) and every search returns nothing. I think,
I misunderstood something - I'm quite new to JSON/Java and similar
technique.
can you double check that the attachment plugin is installed correctly?
make sure both tika and mapper attachment jar files are in <ES_HOME>/lib
folder
For example if you user 0.17.0-SNAPSHOT you should see the following two
files in lib folder:
elasticsearch-mapper-attachments-0.17.0-SNAPSHOT.jar
tika-app-0.9.jar
is attachment plugin recognized when ES startup?
You can see this in log file. Navigate to <ES_HOME>/logs and check ES
startup log records sequence. You should see something like
at the moment I'm testing ES for the use as an indexing and storaging
solution. I use the following general settings with analysis-icu and
mapper-attachments plugins enabled:
Unfortunately, the content field does not contain any keyword from the
pdf (just the long string) and every search returns nothing. I think,
I misunderstood something - I'm quite new to JSON/Java and similar
technique.
I use the precompiled ES 0.16.2 and the two files weren't there, but I
found them to be in ESHOME/plugins/mapper-attachments/ and just copied
them to lib/.
Perhaps that helps somehow?!
can you double check that the attachment plugin is installed correctly?
make sure both tika and mapper attachment jar files are in <ES_HOME>/lib
folder
For example if you user 0.17.0-SNAPSHOT you should see the following two
files in lib folder:
elasticsearch-mapper-attachments-0.17.0-SNAPSHOT.jar
tika-app-0.9.jar
is attachment plugin recognized when ES startup?
You can see this in log file. Navigate to <ES_HOME>/logs and check ES
startup log records sequence. You should see something like
at the moment I'm testing ES for the use as an indexing and storaging
solution. I use the following general settings with analysis-icu and
mapper-attachments plugins enabled:
Unfortunately, the content field does not contain any keyword from the
pdf (just the long string) and every search returns nothing. I think,
I misunderstood something - I'm quite new to JSON/Java and similar
technique.
You can either install plugins manually by copying these files to the lib
folder or you can use bin/plugin install command as explained here:
Once you have both plugins correctly installed and ES is loading them during
startup then you should be ready to use them. Did you try it again? What was
the result now? Any exceptions in log files when indexing attachments or
when searching them?
I use the precompiled ES 0.16.2 and the two files weren't there, but I
found them to be in ESHOME/plugins/mapper-attachments/ and just copied
them to lib/.
Perhaps that helps somehow?!
can you double check that the attachment plugin is installed correctly?
make sure both tika and mapper attachment jar files are in
<ES_HOME>/lib
folder
For example if you user 0.17.0-SNAPSHOT you should see the following
two
files in lib folder:
elasticsearch-mapper-attachments-0.17.0-SNAPSHOT.jar
tika-app-0.9.jar
is attachment plugin recognized when ES startup?
You can see this in log file. Navigate to <ES_HOME>/logs and check ES
startup log records sequence. You should see something like
at the moment I'm testing ES for the use as an indexing and storaging
solution. I use the following general settings with analysis-icu and
mapper-attachments plugins enabled:
Unfortunately, the content field does not contain any keyword from the
pdf (just the long string) and every search returns nothing. I think,
I misunderstood something - I'm quite new to JSON/Java and similar
technique.
1.) what I meant to say was that I had both plugins already installed
with the command you cited (bin/plugin install ....) and they seem to
be loaded as I've shown above. Nevertheless, the associated jar-files
are not in the lib/-directory, only in plugins/mapper-attachments.
Do you think I have to manually move or copy them?
2.) And you are right, in the logfiles I get an error if I'm trying to
index a pdf-file:
[2011-07-07 15:53:30,296][DEBUG][action.index ] [test]
[testindex][0], node[YEPolMwkRhe_eJdNAyuz4Q], [P], s[STARTED]: Failed
to execute [index {[testindex][testmap][1], source}]
org.elasticsearch.ElasticSearchParseException: Failed to derive
xcontent from
at
org.elasticsearch.common.xcontent.XContentFactory.xContent(XContentFactory.java:
181)
at
org.elasticsearch.common.xcontent.XContentFactory.xContent(XContentFactory.java:
172)
at
org.elasticsearch.index.mapper.xcontent.XContentDocumentMapper.parse(XContentDocumentMapper.java:
401)
at
org.elasticsearch.index.mapper.xcontent.XContentDocumentMapper.parse(XContentDocumentMapper.java:
380)
at
org.elasticsearch.index.shard.service.InternalIndexShard.prepareIndex(InternalIndexShard.java:
278)
[......]
So this looks as if the plugin for treating the content isn't working.
Once you have both plugins correctly installed and ES is loading them during
startup then you should be ready to use them. Did you try it again? What was
the result now? Any exceptions in log files when indexing attachments or
when searching them?
I use the precompiled ES 0.16.2 and the two files weren't there, but I
found them to be in ESHOME/plugins/mapper-attachments/ and just copied
them to lib/.
Perhaps that helps somehow?!
can you double check that the attachment plugin is installed correctly?
make sure both tika and mapper attachment jar files are in
<ES_HOME>/lib
folder
For example if you user 0.17.0-SNAPSHOT you should see the following
two
files in lib folder:
elasticsearch-mapper-attachments-0.17.0-SNAPSHOT.jar
tika-app-0.9.jar
is attachment plugin recognized when ES startup?
You can see this in log file. Navigate to <ES_HOME>/logs and check ES
startup log records sequence. You should see something like
at the moment I'm testing ES for the use as an indexing and storaging
solution. I use the following general settings with analysis-icu and
mapper-attachments plugins enabled:
Unfortunately, the content field does not contain any keyword from the
pdf (just the long string) and every search returns nothing. I think,
I misunderstood something - I'm quite new to JSON/Java and similar
technique.
1.) what I meant to say was that I had both plugins already installed
with the command you cited (bin/plugin install ....) and they seem to
be loaded as I've shown above. Nevertheless, the associated jar-files
are not in the lib/-directory, only in plugins/mapper-attachments.
Do you think I have to manually move or copy them?
I normally do not use plugin script for installation because I directly copy
related jar files into lib folder. However, using plugin script is probably
better way and as long as you can see those plugins are loaded during node
startup then they you did good job (assuming you made sure you installed
plugins on ALL nodes in the cluster).
2.) And you are right, in the logfiles I get an error if I'm trying to
index a pdf-file:
So probably we should try to find a simple working example because from the
quick glance at your mapping file it looks fine to me. Again I assume that
you index your json document into /testindex/testmap/
[2011-07-07 15:53:30,296][DEBUG][action.index ] [test]
[testindex][0], node[YEPolMwkRhe_eJdNAyuz4Q], [P], s[STARTED]: Failed
to execute [index {[testindex][testmap][1], source}]
org.elasticsearch.ElasticSearchParseException: Failed to derive
xcontent from
at
org.elasticsearch.common.xcontent.XContentFactory.xContent(XContentFactory.java:
181)
at
org.elasticsearch.common.xcontent.XContentFactory.xContent(XContentFactory.java:
172)
at
org.elasticsearch.index.mapper.xcontent.XContentDocumentMapper.parse(XContentDocumentMapper.java:
401)
at
org.elasticsearch.index.mapper.xcontent.XContentDocumentMapper.parse(XContentDocumentMapper.java:
380)
at
Once you have both plugins correctly installed and ES is loading them
during
startup then you should be ready to use them. Did you try it again? What
was
the result now? Any exceptions in log files when indexing attachments or
when searching them?
I use the precompiled ES 0.16.2 and the two files weren't there, but I
found them to be in ESHOME/plugins/mapper-attachments/ and just copied
them to lib/.
Perhaps that helps somehow?!
can you double check that the attachment plugin is installed
correctly?
make sure both tika and mapper attachment jar files are in
<ES_HOME>/lib
folder
For example if you user 0.17.0-SNAPSHOT you should see the
following
two
files in lib folder:
elasticsearch-mapper-attachments-0.17.0-SNAPSHOT.jar
tika-app-0.9.jar
is attachment plugin recognized when ES startup?
You can see this in log file. Navigate to <ES_HOME>/logs and check
ES
startup log records sequence. You should see something like
at the moment I'm testing ES for the use as an indexing and
storaging
solution. I use the following general settings with analysis-icu
and
mapper-attachments plugins enabled:
Unfortunately, the content field does not contain any keyword from
the
pdf (just the long string) and every search returns nothing. I
think,
I misunderstood something - I'm quite new to JSON/Java and similar
technique.
Yeah, but at the moment I've really no clue. Isn't there any
possibility to verify whether the plugin is working or not? Or do you
have another idea what to try else?
1.) what I meant to say was that I had both plugins already installed
with the command you cited (bin/plugin install ....) and they seem to
be loaded as I've shown above. Nevertheless, the associated jar-files
are not in the lib/-directory, only in plugins/mapper-attachments.
Do you think I have to manually move or copy them?
I normally do not use plugin script for installation because I directly copy
related jar files into lib folder. However, using plugin script is probably
better way and as long as you can see those plugins are loaded during node
startup then they you did good job (assuming you made sure you installed
plugins on ALL nodes in the cluster).
2.) And you are right, in the logfiles I get an error if I'm trying to
index a pdf-file:
So probably we should try to find a simple working example because from the
quick glance at your mapping file it looks fine to me. Again I assume that
you index your json document into /testindex/testmap/
[2011-07-07 15:53:30,296][DEBUG][action.index ] [test]
[testindex][0], node[YEPolMwkRhe_eJdNAyuz4Q], [P], s[STARTED]: Failed
to execute [index {[testindex][testmap][1], source}]
org.elasticsearch.ElasticSearchParseException: Failed to derive
xcontent from
at
org.elasticsearch.common.xcontent.XContentFactory.xContent(XContentFactory.java:
181)
at
org.elasticsearch.common.xcontent.XContentFactory.xContent(XContentFactory.java:
172)
at
org.elasticsearch.index.mapper.xcontent.XContentDocumentMapper.parse(XContentDocumentMapper.java:
401)
at
org.elasticsearch.index.mapper.xcontent.XContentDocumentMapper.parse(XContentDocumentMapper.java:
380)
at
Once you have both plugins correctly installed and ES is loading them
during
startup then you should be ready to use them. Did you try it again? What
was
the result now? Any exceptions in log files when indexing attachments or
when searching them?
I use the precompiled ES 0.16.2 and the two files weren't there, but I
found them to be in ESHOME/plugins/mapper-attachments/ and just copied
them to lib/.
Perhaps that helps somehow?!
can you double check that the attachment plugin is installed
correctly?
make sure both tika and mapper attachment jar files are in
<ES_HOME>/lib
folder
For example if you user 0.17.0-SNAPSHOT you should see the
following
two
files in lib folder:
elasticsearch-mapper-attachments-0.17.0-SNAPSHOT.jar
tika-app-0.9.jar
is attachment plugin recognized when ES startup?
You can see this in log file. Navigate to <ES_HOME>/logs and check
ES
startup log records sequence. You should see something like
at the moment I'm testing ES for the use as an indexing and
storaging
solution. I use the following general settings with analysis-icu
and
mapper-attachments plugins enabled:
Unfortunately, the content field does not contain any keyword from
the
pdf (just the long string) and every search returns nothing. I
think,
I misunderstood something - I'm quite new to JSON/Java and similar
technique.
Yeah, but at the moment I've really no clue. Isn't there any
possibility to verify whether the plugin is working or not? Or do you
have another idea what to try else?
1.) what I meant to say was that I had both plugins already installed
with the command you cited (bin/plugin install ....) and they seem to
be loaded as I've shown above. Nevertheless, the associated jar-files
are not in the lib/-directory, only in plugins/mapper-attachments.
Do you think I have to manually move or copy them?
I normally do not use plugin script for installation because I directly copy
related jar files into lib folder. However, using plugin script is probably
better way and as long as you can see those plugins are loaded during node
startup then they you did good job (assuming you made sure you installed
plugins on ALL nodes in the cluster).
2.) And you are right, in the logfiles I get an error if I'm trying to
index a pdf-file:
So probably we should try to find a simple working example because from the
quick glance at your mapping file it looks fine to me. Again I assume that
you index your json document into /testindex/testmap/
[2011-07-07 15:53:30,296][DEBUG][action.index ] [test]
[testindex][0], node[YEPolMwkRhe_eJdNAyuz4Q], [P], s[STARTED]: Failed
to execute [index {[testindex][testmap][1], source}]
org.elasticsearch.ElasticSearchParseException: Failed to derive
xcontent from
at
org.elasticsearch.common.xcontent.XContentFactory.xContent(XContentFactory.java:
181)
at
org.elasticsearch.common.xcontent.XContentFactory.xContent(XContentFactory.java:
172)
at
org.elasticsearch.index.mapper.xcontent.XContentDocumentMapper.parse(XContentDocumentMapper.java:
401)
at
org.elasticsearch.index.mapper.xcontent.XContentDocumentMapper.parse(XContentDocumentMapper.java:
380)
at
Once you have both plugins correctly installed and ES is loading them
during
startup then you should be ready to use them. Did you try it again? What
was
the result now? Any exceptions in log files when indexing attachments or
when searching them?
I use the precompiled ES 0.16.2 and the two files weren't there, but I
found them to be in ESHOME/plugins/mapper-attachments/ and just copied
them to lib/.
Perhaps that helps somehow?!
can you double check that the attachment plugin is installed
correctly?
make sure both tika and mapper attachment jar files are in
<ES_HOME>/lib
folder
For example if you user 0.17.0-SNAPSHOT you should see the
following
two
files in lib folder:
elasticsearch-mapper-attachments-0.17.0-SNAPSHOT.jar
tika-app-0.9.jar
is attachment plugin recognized when ES startup?
You can see this in log file. Navigate to <ES_HOME>/logs and check
ES
startup log records sequence. You should see something like
at the moment I'm testing ES for the use as an indexing and
storaging
solution. I use the following general settings with analysis-icu
and
mapper-attachments plugins enabled:
Unfortunately, the content field does not contain any keyword from
the
pdf (just the long string) and every search returns nothing. I
think,
I misunderstood something - I'm quite new to JSON/Java and similar
technique.
Yeah, but at the moment I've really no clue. Isn't there any
possibility to verify whether the plugin is working or not? Or do you
have another idea what to try else?
1.) what I meant to say was that I had both plugins already installed
with the command you cited (bin/plugin install ....) and they seem to
be loaded as I've shown above. Nevertheless, the associated jar-files
are not in the lib/-directory, only in plugins/mapper-attachments.
Do you think I have to manually move or copy them?
I normally do not use plugin script for installation because I directly copy
related jar files into lib folder. However, using plugin script is probably
better way and as long as you can see those plugins are loaded during node
startup then they you did good job (assuming you made sure you installed
plugins on ALL nodes in the cluster).
2.) And you are right, in the logfiles I get an error if I'm trying to
index a pdf-file:
So probably we should try to find a simple working example because from the
quick glance at your mapping file it looks fine to me. Again I assume that
you index your json document into /testindex/testmap/
[2011-07-07 15:53:30,296][DEBUG][action.index ] [test]
[testindex][0], node[YEPolMwkRhe_eJdNAyuz4Q], [P], s[STARTED]: Failed
to execute [index {[testindex][testmap][1], source}]
org.elasticsearch.ElasticSearchParseException: Failed to derive
xcontent from
at
org.elasticsearch.common.xcontent.XContentFactory.xContent(XContentFactory.java:
181)
at
org.elasticsearch.common.xcontent.XContentFactory.xContent(XContentFactory.java:
172)
at
org.elasticsearch.index.mapper.xcontent.XContentDocumentMapper.parse(XContentDocumentMapper.java:
401)
at
org.elasticsearch.index.mapper.xcontent.XContentDocumentMapper.parse(XContentDocumentMapper.java:
380)
at
Once you have both plugins correctly installed and ES is loading them
during
startup then you should be ready to use them. Did you try it again? What
was
the result now? Any exceptions in log files when indexing attachments or
when searching them?
I use the precompiled ES 0.16.2 and the two files weren't there, but I
found them to be in ESHOME/plugins/mapper-attachments/ and just copied
them to lib/.
Perhaps that helps somehow?!
can you double check that the attachment plugin is installed
correctly?
make sure both tika and mapper attachment jar files are in
<ES_HOME>/lib
folder
For example if you user 0.17.0-SNAPSHOT you should see the
following
two
files in lib folder:
elasticsearch-mapper-attachments-0.17.0-SNAPSHOT.jar
tika-app-0.9.jar
is attachment plugin recognized when ES startup?
You can see this in log file. Navigate to <ES_HOME>/logs and check
ES
startup log records sequence. You should see something like
at the moment I'm testing ES for the use as an indexing and
storaging
solution. I use the following general settings with analysis-icu
and
mapper-attachments plugins enabled:
Unfortunately, the content field does not contain any keyword from
the
pdf (just the long string) and every search returns nothing. I
think,
I misunderstood something - I'm quite new to JSON/Java and similar
technique.
Yeah, but at the moment I've really no clue. Isn't there any
possibility to verify whether the plugin is working or not? Or do you
have another idea what to try else?
1.) what I meant to say was that I had both plugins already installed
with the command you cited (bin/plugin install ....) and they seem to
be loaded as I've shown above. Nevertheless, the associated jar-files
are not in the lib/-directory, only in plugins/mapper-attachments.
Do you think I have to manually move or copy them?
I normally do not use plugin script for installation because I
directly copy
related jar files into lib folder. However, using plugin script is
probably
better way and as long as you can see those plugins are loaded during
node
startup then they you did good job (assuming you made sure you
installed
plugins on ALL nodes in the cluster).
2.) And you are right, in the logfiles I get an error if I'm trying
to
index a pdf-file:
So probably we should try to find a simple working example because
from the
quick glance at your mapping file it looks fine to me. Again I assume
that
you index your json document into /testindex/testmap/
[2011-07-07 15:53:30,296][DEBUG][action.index ] [test]
[testindex][0], node[YEPolMwkRhe_eJdNAyuz4Q], [P], s[STARTED]: Failed
to execute [index {[testindex][testmap][1], source}]
org.elasticsearch.ElasticSearchParseException: Failed to derive
xcontent from
at
Once you have both plugins correctly installed and ES is loading
them
during
startup then you should be ready to use them. Did you try it again?
What
was
the result now? Any exceptions in log files when indexing
attachments or
when searching them?
I use the precompiled ES 0.16.2 and the two files weren't there,
but I
found them to be in ESHOME/plugins/mapper-attachments/ and just
copied
them to lib/.
Perhaps that helps somehow?!
can you double check that the attachment plugin is installed
correctly?
make sure both tika and mapper attachment jar files are in
<ES_HOME>/lib
folder
For example if you user 0.17.0-SNAPSHOT you should see the
following
two
files in lib folder:
elasticsearch-mapper-attachments-0.17.0-SNAPSHOT.jar
tika-app-0.9.jar
is attachment plugin recognized when ES startup?
You can see this in log file. Navigate to <ES_HOME>/logs and
check
ES
startup log records sequence. You should see something like
at the moment I'm testing ES for the use as an indexing and
storaging
solution. I use the following general settings with analysis-icu
and
mapper-attachments plugins enabled:
Unfortunately, the content field does not contain any keyword
from
the
pdf (just the long string) and every search returns nothing. I
think,
I misunderstood something - I'm quite new to JSON/Java and
similar
technique.
Yeah, but at the moment I've really no clue. Isn't there any
possibility to verify whether the plugin is working or not? Or do you
have another idea what to try else?
1.) what I meant to say was that I had both plugins already installed
with the command you cited (bin/plugin install ....) and they seem to
be loaded as I've shown above. Nevertheless, the associated jar-files
are not in the lib/-directory, only in plugins/mapper-attachments.
Do you think I have to manually move or copy them?
I normally do not use plugin script for installation because I
directly copy
related jar files into lib folder. However, using plugin script is
probably
better way and as long as you can see those plugins are loaded during
node
startup then they you did good job (assuming you made sure you
installed
plugins on ALL nodes in the cluster).
2.) And you are right, in the logfiles I get an error if I'm trying
to
index a pdf-file:
So probably we should try to find a simple working example because
from the
quick glance at your mapping file it looks fine to me. Again I assume
that
you index your json document into /testindex/testmap/
[2011-07-07 15:53:30,296][DEBUG][action.index ] [test]
[testindex][0], node[YEPolMwkRhe_eJdNAyuz4Q], [P], s[STARTED]: Failed
to execute [index {[testindex][testmap][1], source}]
org.elasticsearch.ElasticSearchParseException: Failed to derive
xcontent from
at
Once you have both plugins correctly installed and ES is loading
them
during
startup then you should be ready to use them. Did you try it again?
What
was
the result now? Any exceptions in log files when indexing
attachments or
when searching them?
I use the precompiled ES 0.16.2 and the two files weren't there,
but I
found them to be in ESHOME/plugins/mapper-attachments/ and just
copied
them to lib/.
Perhaps that helps somehow?!
can you double check that the attachment plugin is installed
correctly?
make sure both tika and mapper attachment jar files are in
<ES_HOME>/lib
folder
For example if you user 0.17.0-SNAPSHOT you should see the
following
two
files in lib folder:
elasticsearch-mapper-attachments-0.17.0-SNAPSHOT.jar
tika-app-0.9.jar
is attachment plugin recognized when ES startup?
You can see this in log file. Navigate to <ES_HOME>/logs and
check
ES
startup log records sequence. You should see something like
at the moment I'm testing ES for the use as an indexing and
storaging
solution. I use the following general settings with analysis-icu
and
mapper-attachments plugins enabled:
Unfortunately, the content field does not contain any keyword
from
the
pdf (just the long string) and every search returns nothing. I
think,
I misunderstood something - I'm quite new to JSON/Java and
similar
technique.
sorry, there were some issues in the script. Fixed.
First issue was on the curl command (line 13) using "_mappings" instead of
"_mapping"
Second issue in the perl command, I was missing -n option. (line 29)
Then I added a refresh after indexing.
Finally, I enhanced mapping and the query to pull only the document title.
Yeah, but at the moment I've really no clue. Isn't there any
possibility to verify whether the plugin is working or not? Or do
you
have another idea what to try else?
1.) what I meant to say was that I had both plugins already
installed
with the command you cited (bin/plugin install ....) and they
seem to
be loaded as I've shown above. Nevertheless, the associated
jar-files
are not in the lib/-directory, only in
plugins/mapper-attachments.
Do you think I have to manually move or copy them?
I normally do not use plugin script for installation because I
directly copy
related jar files into lib folder. However, using plugin script is
probably
better way and as long as you can see those plugins are loaded
during
node
startup then they you did good job (assuming you made sure you
installed
plugins on ALL nodes in the cluster).
2.) And you are right, in the logfiles I get an error if I'm
trying
to
index a pdf-file:
So probably we should try to find a simple working example because
from the
quick glance at your mapping file it looks fine to me. Again I
assume
that
you index your json document into /testindex/testmap/
[2011-07-07 15:53:30,296][DEBUG][action.index ]
[test]
[testindex][0], node[YEPolMwkRhe_eJdNAyuz4Q], [P], s[STARTED]:
Failed
to execute [index {[testindex][testmap][1], source}]
org.elasticsearch.ElasticSearchParseException: Failed to derive
xcontent from
at
Once you have both plugins correctly installed and ES is loading
them
during
startup then you should be ready to use them. Did you try it
again?
What
was
the result now? Any exceptions in log files when indexing
attachments or
when searching them?
I use the precompiled ES 0.16.2 and the two files weren't
there,
but I
found them to be in ESHOME/plugins/mapper-attachments/ and just
copied
them to lib/.
Perhaps that helps somehow?!
can you double check that the attachment plugin is installed
correctly?
make sure both tika and mapper attachment jar files are in
<ES_HOME>/lib
folder
For example if you user 0.17.0-SNAPSHOT you should see the
following
two
files in lib folder:
elasticsearch-mapper-attachments-0.17.0-SNAPSHOT.jar
tika-app-0.9.jar
is attachment plugin recognized when ES startup?
You can see this in log file. Navigate to <ES_HOME>/logs and
check
ES
startup log records sequence. You should see something like
at the moment I'm testing ES for the use as an indexing and
storaging
solution. I use the following general settings with
analysis-icu
and
mapper-attachments plugins enabled:
Unfortunately, the content field does not contain any
keyword
from
the
pdf (just the long string) and every search returns nothing.
I
think,
I misunderstood something - I'm quite new to JSON/Java and
similar
technique.
sorry, there were some issues in the script. Fixed.
First issue was on the curl command (line 13) using "_mappings" instead of
"_mapping"
Second issue in the perl command, I was missing -n option. (line 29)
Then I added a refresh after indexing.
Finally, I enhanced mapping and the query to pull only the document title.
Yeah, but at the moment I've really no clue. Isn't there any
possibility to verify whether the plugin is working or not? Or do
you
have another idea what to try else?
1.) what I meant to say was that I had both plugins already
installed
with the command you cited (bin/plugin install ....) and they
seem to
be loaded as I've shown above. Nevertheless, the associated
jar-files
are not in the lib/-directory, only in
plugins/mapper-attachments.
Do you think I have to manually move or copy them?
I normally do not use plugin script for installation because I
directly copy
related jar files into lib folder. However, using plugin script is
probably
better way and as long as you can see those plugins are loaded
during
node
startup then they you did good job (assuming you made sure you
installed
plugins on ALL nodes in the cluster).
2.) And you are right, in the logfiles I get an error if I'm
trying
to
index a pdf-file:
So probably we should try to find a simple working example because
from the
quick glance at your mapping file it looks fine to me. Again I
assume
that
you index your json document into /testindex/testmap/
[2011-07-07 15:53:30,296][DEBUG][action.index ]
[test]
[testindex][0], node[YEPolMwkRhe_eJdNAyuz4Q], [P], s[STARTED]:
Failed
to execute [index {[testindex][testmap][1], source}]
org.elasticsearch.ElasticSearchParseException: Failed to derive
xcontent from
at
Once you have both plugins correctly installed and ES is loading
them
during
startup then you should be ready to use them. Did you try it
again?
What
was
the result now? Any exceptions in log files when indexing
attachments or
when searching them?
I use the precompiled ES 0.16.2 and the two files weren't
there,
but I
found them to be in ESHOME/plugins/mapper-attachments/ and just
copied
them to lib/.
Perhaps that helps somehow?!
can you double check that the attachment plugin is installed
correctly?
make sure both tika and mapper attachment jar files are in
<ES_HOME>/lib
folder
For example if you user 0.17.0-SNAPSHOT you should see the
following
two
files in lib folder:
elasticsearch-mapper-attachments-0.17.0-SNAPSHOT.jar
tika-app-0.9.jar
is attachment plugin recognized when ES startup?
You can see this in log file. Navigate to <ES_HOME>/logs and
check
ES
startup log records sequence. You should see something like
at the moment I'm testing ES for the use as an indexing and
storaging
solution. I use the following general settings with
analysis-icu
and
mapper-attachments plugins enabled:
This exception comes from Tika. Generally speaking, parsing PDF documents is
a pain for anybody, not only for Tika (or pdfbox which Tika delegates to).
So the chance is that Tika is hitting some issues with this particular
document. The question is if this is serious or not. Note that if you are
seeing this when parsing the document from my example then this PDF document
contains a lot of complex charts and tables with texts. Typically such
documents (highly technical product sheets) are generated automatically from
product data store and some level of formatting issues are probably expected
(but I am just guessing here).
sorry, there were some issues in the script. Fixed.
First issue was on the curl command (line 13) using "_mappings" instead
of
"_mapping"
Second issue in the perl command, I was missing -n option. (line 29)
Then I added a refresh after indexing.
Finally, I enhanced mapping and the query to pull only the document
title.
Yeah, but at the moment I've really no clue. Isn't there any
possibility to verify whether the plugin is working or not? Or
do
you
have another idea what to try else?
1.) what I meant to say was that I had both plugins already
installed
with the command you cited (bin/plugin install ....) and they
seem to
be loaded as I've shown above. Nevertheless, the associated
jar-files
are not in the lib/-directory, only in
plugins/mapper-attachments.
Do you think I have to manually move or copy them?
I normally do not use plugin script for installation because I
directly copy
related jar files into lib folder. However, using plugin
script is
probably
better way and as long as you can see those plugins are loaded
during
node
startup then they you did good job (assuming you made sure you
installed
plugins on ALL nodes in the cluster).
2.) And you are right, in the logfiles I get an error if I'm
trying
to
index a pdf-file:
So probably we should try to find a simple working example
because
from the
quick glance at your mapping file it looks fine to me. Again I
assume
that
you index your json document into /testindex/testmap/
[2011-07-07 15:53:30,296][DEBUG][action.index ]
[test]
[testindex][0], node[YEPolMwkRhe_eJdNAyuz4Q], [P],
s[STARTED]:
Failed
to execute [index {[testindex][testmap][1], source}]
org.elasticsearch.ElasticSearchParseException: Failed to
derive
xcontent from
at
Once you have both plugins correctly installed and ES is
loading
them
during
startup then you should be ready to use them. Did you try it
again?
What
was
the result now? Any exceptions in log files when indexing
attachments or
when searching them?
I use the precompiled ES 0.16.2 and the two files weren't
there,
but I
found them to be in ESHOME/plugins/mapper-attachments/ and
just
copied
them to lib/.
Perhaps that helps somehow?!
can you double check that the attachment plugin is
installed
correctly?
make sure both tika and mapper attachment jar files
are in
<ES_HOME>/lib
folder
For example if you user 0.17.0-SNAPSHOT you should see
the
following
two
files in lib folder:
elasticsearch-mapper-attachments-0.17.0-SNAPSHOT.jar
tika-app-0.9.jar
is attachment plugin recognized when ES startup?
You can see this in log file. Navigate to <ES_HOME>/logs
and
check
ES
startup log records sequence. You should see something
like
at the moment I'm testing ES for the use as an indexing
and
storaging
solution. I use the following general settings with
analysis-icu
and
mapper-attachments plugins enabled:
The warnings are expected as this requires some permissions to run.
But you should better open a new thread instead of answering on a thread which is out-dated.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.