Hey,
I've downloaded the master branch from git and installed the beta in order to compare 'best_compression' VS the current platform's compression with our data set.
What I did was indexing 1840000 documents and then changed the config file and added index.codec: “best_compression”, restarted and used the optimise API.
Unfortunately I did not see much of a difference, and actually the index was bigger for the 'best_compression'.
My 1.6.0 runs on CentOS and the following is the information of the index size:
docs.count docs.deleted store.size pri.store.size
1840000 0 152mb 152mb
The 2.0.0 beta runs on OS X and the following is the index size:
docs.count docs.deleted store.size pri.store.size
1840000 0 156mb 156mb
My data set is a mix of random ints and strings of 80000 documents, which were duplicated 23 times.
What am I doing wrong?
Or what should I expect?