Elasticsearch river and mongodb replica

Robbie_Cheng · October 23, 2012, 5:03pm

According to the documentation of elasticsearch river to mongodb. It
requires oplog to pull data from mongo. We tired to run a replication
set with single node (yes, cheating) instructed by the following url:

And it looks like river pulls data from mongodb successfully, but we
would like to know more detail since we only spotted 2 records in
oplog, how does river pull existing data from replication set? And is
there any dependency of creating replication set in mongodb, and
hooking up ES and mongodb with river? I mean, if we have running
replication set, and we hook it up with ES afterwards, what happen to
those existing data? Will they be pulled, too?

Thanks,

--

Richard_Louapre · November 5, 2012, 8:22am

Hi,

You can find more details oplog and internal replication in MongoDB here
[1].
This is how currently the data are pulled from mongo inot ES.

[1] - Replication Internals – Kristina Chodorow's Blog

Thanks,
Richard.

On Tuesday, October 23, 2012 1:03:22 PM UTC-4, Robbie Cheng wrote:

According to the documentation of elasticsearch river to mongodb. It
requires oplog to pull data from mongo. We tired to run a replication
set with single node (yes, cheating) instructed by the following url:

How to get a MongoDB oplog without a full replica set | Loose XAML

And it looks like river pulls data from mongodb successfully, but we
would like to know more detail since we only spotted 2 records in
oplog, how does river pull existing data from replication set? And is
there any dependency of creating replication set in mongodb, and
hooking up ES and mongodb with river? I mean, if we have running
replication set, and we hook it up with ES afterwards, what happen to
those existing data? Will they be pulled, too?

Thanks,

--