Hi, I am pretty new to elasticsearch, so I'm sorry if this is a trivial question. I am trying to figure out if it is possible to have a setup like this:
Machine A has a large corpus of documents. Machine B is running elasticsearch.
Machine A scans it's corpus of documents and runs the elasticsearch full-text indexing procedure on that content. Machine A sends the compact indexed version of the documents to Machine B. Machine B inserts the index provided by Machine A into it's elasticsearch instance. Machine B is now able to search for documents on Machine A and return IDs representing files from Machine A. Machine B has never seen the full content of Machine A's files.
By doing it this way I do not need to move the data around as much and I would probably save a lot of bandwidth and time.
Is this at all possible, or does Elasticsearch require the entire content to be on the same cluster/machine as is running the elasticsearch instance? I have not been able to find any information about this anywhere, possibly due to my lack of understanding.
If anyone could enlighten me on this subject that would be great!