I've indexed few pdf files. I followed {{ test_file name }} index convention. which allowed me to create few indices depending on the title of the pdf files.
But I'm confused whether to create multiple indices or just one Index.
As I've indexed less data as part of POC now, search speed is good.(anyway taking less time when I use single Index)
Search Relevancy: Search results are varying. When I search the same query against single and multiple indices different results are being displayed. sometimes Single Index results are accurate and sometimes they are not.
How to decide on this-- whether to use single index or multiple indices ? Please suggest
Note: This is not log data and the data is stored permanently. Also,new data will be added to this weekly/monthly.
to keep it more clear: As the data will be more when we move to production(say more than 100GB) is it good to go with one index? or it's better to split into multiple indices?
In addition any differences in Search Relevancy when using single index and multiple Indices?
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.