No Observable Difference Between BBQ and Default Configurations in Elasticsearch – Help with Index Size Comparison

Hi @BenTrent ,

thanks for sharing your insights and sorry for injecting questions/comments here.

  1. Keeping raw floating point values around _source seems to be inefficient approach, like discussed in Knn_vectors field understanding - Elastic Stack / Elasticsearch - Discuss the Elastic Stack, right?
  2. If we don't keep raw floating point values around _source, the values are still persist by underlaying Lucene, right? Would it be fine if we fetch values from Lucene for rescoring purpose?
  3. For re-quantizing and segment merging, may I ask whether we indeed need to keep the raw floating point values for any type of quantization? And how the re-quantization works? would be appreciated if you could share any resources.

Thanks a lot and looking forward for your reply.

Best,
Yakun