Although I have read a lot of guides and articles, there seems still
something I missed in understanding the process of document indexing and
searching. I would like to know if there are any comprehensive documents
which describe the process very detailly?
For example, the document indexing process involves document preprocessing,
text analyzing, load into memory buffer (in the mean time also write this
operation into translog), write out as segment files, ..., etc. The
document indexing operation in Elasticsearch is actually quite complicated,
but the articles I can found on the Internet always only cover some of
these steps.
If you have that basic knowledge, perhaps the next steps would be read into
the code, for instance, you mentioned about translog, so find for the
translog class file and start from there?
Although I have read a lot of guides and articles, there seems still
something I missed in understanding the process of document indexing and
searching. I would like to know if there are any comprehensive documents
which describe the process very detailly?
For example, the document indexing process involves document
preprocessing, text analyzing, load into memory buffer (in the mean time
also write this operation into translog), write out as segment files, ...,
etc. The document indexing operation in Elasticsearch is actually quite
complicated, but the articles I can found on the Internet always only cover
some of these steps.
If you have that basic knowledge, perhaps the next steps would be read
into the code, for instance, you mentioned about translog, so find for the
translog class file and start from there?
Although I have read a lot of guides and articles, there seems still
something I missed in understanding the process of document indexing and
searching. I would like to know if there are any comprehensive documents
which describe the process very detailly?
For example, the document indexing process involves document
preprocessing, text analyzing, load into memory buffer (in the mean time
also write this operation into translog), write out as segment files, ...,
etc. The document indexing operation in Elasticsearch is actually quite
complicated, but the articles I can found on the Internet always only cover
some of these steps.
If you have that basic knowledge, perhaps the next steps would be read
into the code, for instance, you mentioned about translog, so find for the
translog class file and start from there?
Although I have read a lot of guides and articles, there seems still
something I missed in understanding the process of document indexing and
searching. I would like to know if there are any comprehensive documents
which describe the process very detailly?
For example, the document indexing process involves document
preprocessing, text analyzing, load into memory buffer (in the mean time
also write this operation into translog), write out as segment files, ...,
etc. The document indexing operation in Elasticsearch is actually quite
complicated, but the articles I can found on the Internet always only cover
some of these steps.
Apache, Apache Lucene, Apache Hadoop, Hadoop, HDFS and the yellow elephant
logo are trademarks of the
Apache Software Foundation
in the United States and/or other countries.