I finally finished a grueling upgrade of my local code from Lucene 3.6 to
4.3. I don't use elasticsearch for everything and still have a fair amount
of Lucene code. You name it, I have a custom class for it.
With the new Lucene jars in place, I was finally able to upgrade
elasticsearch from 0.90.1 from 0.20.0 (Lucene class conflicts being the
obstacle). So far my Lucene code has produced much smaller indices, which
I'm still testing. My elasticsearch specific code has not changed (besides
fixing API changes), and neither has my configuration. I do some
pre-tokenization on the client side for various reasons, but elasticsearch
does the bulk of the analysis. The resulting test index is one third of the
original size:
size: 15.8gb (15.8gb)
docs: 8711039 (8711039)
size: 5.2gb (5.2gb)
docs: 8757039 (8757039)
I did disable timestamps (elasticsearch bug which I will fix), but
everything else is the same. A two-thirds reduction scares me a bit. Has
anyone seen such a dramatic reduction in index size?
Cheers,
Ivan
--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.