Hi @BenTrent ,
thanks for sharing your insights and sorry for injecting questions/comments here.
- Keeping raw floating point values around
_source
seems to be inefficient approach, like discussed in Knn_vectors field understanding - Elastic Stack / Elasticsearch - Discuss the Elastic Stack, right? - If we don't keep raw floating point values around
_source
, the values are still persist by underlaying Lucene, right? Would it be fine if we fetch values from Lucene for rescoring purpose? - For re-quantizing and segment merging, may I ask whether we indeed need to keep the raw floating point values for any type of quantization? And how the re-quantization works? would be appreciated if you could share any resources.
Thanks a lot and looking forward for your reply.
Best,
Yakun