On Tue, 2011-06-28 at 10:44 -0700, mattx wrote:
Wow. The all or nothing approach doesn't work for me. I need to be able to
at least get back the document ID of the thing I am indexing. What I really
need is the ability to throw away only certain fields in the _source.
Am I being stupid? If I bulk index 100k email messages and don't include
the _source then how can I later fetch these emails after doing a search?
Do I have to store the IDs generated by the indexing operation as they map
to my original IDs? I don't love that idea and I'm not even sure how to do
that in a bulk indexing operation.
I don't follow what it is you are trying to do. Whether you index or
bulk_index you get back the ID (either the ID that you specify, or an
Why don't you want the _source? Because it contains too much
information? What about deleting the information that you don't want to
store before indexing the email?
Perhaps a bit more context will help...