we have two indices, a global article index and a specific shop article
index.
The shop article index may contain articles that are already in the global
article index.
These articles may have more information than there global counterparts.
We formulate queries across both indices at once using the multiple indices
api.
How can we discard duplicates?
Either you search only a list of specific shops, or the global index. If
you search both, your design implies you want duplicates (and to remove
them afterwards is always clumsy)
Jörg
Am 18.02.13 11:38, schrieb Lion Vollnhals:
Hello,
we have two indices, a global article index and a specific shop
article index.
The shop article index may contain articles that are already in the
global article index.
These articles may have more information than there global counterparts.
We formulate queries across both indices at once using the multiple
indices api.
How can we discard duplicates?
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.