Anybody know that index-sorting data is physically or logically sorted on disk?
here is my question: if there are existing sorted data, when inserting a new record into the index, how will the new index-sorting data be organized? physically re-sort or logically re-sort ? and why?
New segments are created based on data in the transaction log. Once they have been created they do not change. Periodically segments are merged, which results in new larger segments being created before the old ones that were merged are deleted.
so, it is like once refresh was triggered, new segments would be created from the transaction log, the index-sorting data would be created during that process, except that when segment merging happens, the index-sorting data would be merged again too.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.