I'm not really sure how I'd research the history of this issue, or if it
is in fact an issue or ignorance on my part, but it's nature is elusive as
far as I can tell and so I'm elaborating here...
Description
I have a query in my percolator index that I expect to match a given
document.
I percolate the document and the query is not returned.
I retrieve the query, via a get request; and then dump the body of the
query into a file.
I then index the contents of that file, without modifying it, into the same
index's '.percolator' type, with a different id.
(In other words, I've re-indexed the afflicted query without modifying it.)
Re-percolating the same document now returns the newly indexed query, while
still excluding the original identical query.
The only thing I can think of is that, perhaps somehow the mapping for
.percolator was different at the time of indexing the original query, and
thus there's something different stored in lucene for that query compared
to what's stored for the new query. But I don't have a good enough
understanding of how mappings and storage works for the percolator, and as
far as I've read this isn't commented on in the documentation. Is the above
scenario possible?
I'm not really sure how I'd research the history of this issue, or if
it is in fact an issue or ignorance on my part, but it's nature
is elusive as far as I can tell and so I'm elaborating here...
Description
I have a query in my percolator index that I expect to match a given
document.
I percolate the document and the query is not returned.
I retrieve the query, via a get request; and then dump the body of the
query into a file.
I then index the contents of that file, without modifying it, into the
same index's '.percolator' type, with a different id.
(In other words, I've re-indexed the afflicted query without modifying it.)
Re-percolating the same document now returns the newly indexed query,
while still excluding the original identical query.
The only thing I can think of is that, perhaps somehow the mapping for
.percolator was different at the time of indexing the original query, and
thus there's something different stored in lucene for that query compared
to what's stored for the new query. But I don't have a good enough
understanding of how mappings and storage works for the percolator, and as
far as I've read this isn't commented on in the documentation. Is the above
scenario possible?
{
"doc": {
"random field": [
"\n\nMay 04--The 49ers employ an All-Pro linebacker whose
college career appeared to foreshadow off-the-field trouble in the NFL. He
was involved in an on-campus fight, suspended by his head coach and
admitted to gasolining"
],
"product": "KRT"
}
}
The above is similar to the document I was using, modifying random_field's
value by hand, and with some extra key:value pairs removed. (Note the
implied stemming here -- the identical query referred to above matched, as
stemming was applied correctly.)
On Friday, May 9, 2014 6:16:14 AM UTC-4, Martijn v Groningen wrote:
Hi,
Can you also share the document being percolated? I would expect the query
to match if gasoline occurs in it and 'product' is equal to 'some_product'.
On 8 May 2014 03:45, Adam Georgiou <apg...@gmail.com <javascript:>> wrote:
Disclaimer/Naivety Hedge
I'm not really sure how I'd research the history of this issue, or if
it is in fact an issue or ignorance on my part, but it's nature
is elusive as far as I can tell and so I'm elaborating here...
Description
I have a query in my percolator index that I expect to match a given
document.
I percolate the document and the query is not returned.
I retrieve the query, via a get request; and then dump the body of the
query into a file.
I then index the contents of that file, without modifying it, into the
same index's '.percolator' type, with a different id.
(In other words, I've re-indexed the afflicted query without modifying
it.)
Re-percolating the same document now returns the newly indexed query,
while still excluding the original identical query.
The only thing I can think of is that, perhaps somehow the mapping for
.percolator was different at the time of indexing the original query, and
thus there's something different stored in lucene for that query compared
to what's stored for the new query. But I don't have a good enough
understanding of how mappings and storage works for the percolator, and as
far as I've read this isn't commented on in the documentation. Is the above
scenario possible?
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.