What is bad of using pulsing postings format?


(陳智清) #1

From this article
(http://blog.mikemccandless.com/2010/06/lucenes-pulsingcodec-on-primary-key.html)
I know pulsing codec saves one disk seek hence introduces performance gain.
I would like to know what I pay for using pulsing codec? What will happen
if I give it a high cut-off frequency so that all postings are stored in
term dictionary?

In other words, instead of the goodness, I would like to know what is the
drawback of using pulsing postings format?

Thank you.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/4d4c5ad8-baab-4c81-85a5-dc75095a7f5a%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.


(Robert Muir-2) #2

There is not really an advantage to it.

The optimization has been incorporated into the default index format
of lucene: https://issues.apache.org/jira/browse/LUCENE-4498

On Sun, Jul 13, 2014 at 10:20 PM, 陳智清 walker0902@gmail.com wrote:

From this article
(http://blog.mikemccandless.com/2010/06/lucenes-pulsingcodec-on-primary-key.html)
I know pulsing codec saves one disk seek hence introduces performance gain.
I would like to know what I pay for using pulsing codec? What will happen if
I give it a high cut-off frequency so that all postings are stored in term
dictionary?

In other words, instead of the goodness, I would like to know what is the
drawback of using pulsing postings format?

Thank you.

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/4d4c5ad8-baab-4c81-85a5-dc75095a7f5a%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CAMUKNZXKEDQiBw%3DwmyLPh47bkJh%3DUT5E%2B_yrfS797vgCuADdBg%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.


(Nik Everett) #3

If it doesn't have an advantage can it be deprecated? The whole codecs
page of the Elasticsearch guide seems like it could use a once over by
someone that really knows what they are doing. I.E. not me :slight_smile:
On Jul 13, 2014 10:41 PM, "Robert Muir" robert.muir@elasticsearch.com
wrote:

There is not really an advantage to it.

The optimization has been incorporated into the default index format
of lucene: https://issues.apache.org/jira/browse/LUCENE-4498

On Sun, Jul 13, 2014 at 10:20 PM, 陳智清 walker0902@gmail.com wrote:

From this article
(
http://blog.mikemccandless.com/2010/06/lucenes-pulsingcodec-on-primary-key.html
)
I know pulsing codec saves one disk seek hence introduces performance
gain.
I would like to know what I pay for using pulsing codec? What will
happen if
I give it a high cut-off frequency so that all postings are stored in
term
dictionary?

In other words, instead of the goodness, I would like to know what is the
drawback of using pulsing postings format?

Thank you.

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit

https://groups.google.com/d/msgid/elasticsearch/4d4c5ad8-baab-4c81-85a5-dc75095a7f5a%40googlegroups.com
.

For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/CAMUKNZXKEDQiBw%3DwmyLPh47bkJh%3DUT5E%2B_yrfS797vgCuADdBg%40mail.gmail.com
.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CAPmjWd3u-moKntRgxuUsD5x3rLpND%3DwFrttULA8CWcy2UTP%3Duw%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.


(David Smith-2) #4

+1 on having more information on codecs/postings format. Including examples on where each makes sense and where they don't.

The current documentation on the codec module page in reference doesn't make the trade offs very clear. It's quite vague to somebody who has not used it (not complaining just a statement FYI) before. I think having examples where each makes sense and where they don't would make things much clearer.

Thank you.

On Jul 13, 2014, at 11:17 PM, Nikolas Everett nik9000@gmail.com wrote:

If it doesn't have an advantage can it be deprecated? The whole codecs page of the Elasticsearch guide seems like it could use a once over by someone that really knows what they are doing. I.E. not me :slight_smile:

On Jul 13, 2014 10:41 PM, "Robert Muir" robert.muir@elasticsearch.com wrote:
There is not really an advantage to it.

The optimization has been incorporated into the default index format
of lucene: https://issues.apache.org/jira/browse/LUCENE-4498

On Sun, Jul 13, 2014 at 10:20 PM, 陳智清 walker0902@gmail.com wrote:

From this article
(http://blog.mikemccandless.com/2010/06/lucenes-pulsingcodec-on-primary-key.html)
I know pulsing codec saves one disk seek hence introduces performance gain.
I would like to know what I pay for using pulsing codec? What will happen if
I give it a high cut-off frequency so that all postings are stored in term
dictionary?

In other words, instead of the goodness, I would like to know what is the
drawback of using pulsing postings format?

Thank you.

--
You received this message because you are subscribed to the Google Groups
"elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit
https://groups.google.com/d/msgid/elasticsearch/4d4c5ad8-baab-4c81-85a5-dc75095a7f5a%40googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CAMUKNZXKEDQiBw%3DwmyLPh47bkJh%3DUT5E%2B_yrfS797vgCuADdBg%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/CAPmjWd3u-moKntRgxuUsD5x3rLpND%3DwFrttULA8CWcy2UTP%3Duw%40mail.gmail.com.
For more options, visit https://groups.google.com/d/optout.

--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
To view this discussion on the web visit https://groups.google.com/d/msgid/elasticsearch/828C16E1-6E9B-493E-B9FD-985531747D2D%40gmail.com.
For more options, visit https://groups.google.com/d/optout.


(system) #5