Hello,
I'm using elasticsearch 0.90.2 with Elastica (PHP Client), and I am
experiencing some trouble indexing 10,000,000 documents using
elasticsearch.
I'm indexing using bulk queries only, by batch of 500.
The problem is that elasticsearch stop answering to elastica, every time at
the same stage : (Thu, 15 Aug 2013 14:55:15 BST Exporting results 29000 to
29500 to index), and this of course raises an exception
[Elastica\Exception\ClientException] -> No enabled connection, which is
accurate since as far as I know elastica uses XHTTP API to transfer the
documents (ie no socket or anything).
The funny thing is that if I try on a slower machine I manage to get to ~=
600,000 docs before having the same exception.
I've tried to add a sleep(1); between 2 bulks, which only earned me ~=4,000
additional documents indexed before crashing.
I've tried with all number of nodes, to change the Threadpool settings (in
the index and in the global config file, the doc is not too clear about
that), it changed nothing.
Nothing appear in Elasticsearch logs (it bounces without even noticing),
Elasticsearch never crashes (since it is available again after crashing my
indexing script without restarting or printing anything in the logs).
So right now, I can't even index the whole thing (whatever the time it
takes).
What can I do ?
Oh, and disabling auto-refresh didn't change much either.
Thanks,
Lucas
--
You received this message because you are subscribed to the Google Groups "elasticsearch" group.
To unsubscribe from this group and stop receiving emails from it, send an email to elasticsearch+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/groups/opt_out.