I'm working on providing advanced searching for annotated Medical Documents
(using UIMA). In the context of an annotated document, I identify relevant
medical terms, as well as the negation of certain terms. Following what
I've read and seen in Lucene examples, I've been able to provide a search
that takes into account the metadata contained in the payload. The search,
however, is very primitive and uses the PayloadSpanUtil to return the
payloads. Then I iterate over the payloads and exclude those that don't
match the criteria. (I'm looking for a better way to exclude the terms
based on the payload during the query).
I've currently implemented a Custom Analyzer in Lucene and also registered
it in ElasticSearch (kudos -- very easy to integrate). However, with
regard to the searching, I'm not sure how payloads are exposed in ES.
I noticed this post (Token Attributes)http://elasticsearch-users.115913.n3.nabble.com/Token-attributes-td2648940.html in
March. Has anything changed regarding this availability?