i need to search all sorts of word combinations and sub string of words through my search engine ..
to achieve this i am using ngram filters and it is working perfectly okay but the main concern here is the storage space .. for 23000 documents it is taking upto 1.7 gb space in my local system .. i want to reduce it because the live server have many more records .. so how could i reduce this storage amount ..
Unfortunately, that is the main drawback of ngrams. What are your min and
max gram settings? Can you use an edge ngrams instead? If your max ngram
settings is high, consider reducing it.
and one more thing if i have a title column and it contain a phrase or a sentence so to search for that phrase or sentence is ngram the best option or there is some other way possible to do proper search for sentences and phrases ?
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.