Is it possible to use the ingestion plugin in a bulk request?

apanimesh061 · April 15, 2017, 6:46pm

I have a total of around 500,000 documents that was earlier inserting using bulk request of 1000 at a time.
But before I had an ingestion plugin I using a library externally to transform the input and then collect them for a bulk insertion.

Now that I have an ingestion plugin for my purpose, how can I use it to perform bulk document transforms and then send the documents for a bulk insertion?

I am using Elasticsearch 5.2.1.

dadoonet · April 15, 2017, 7:27pm

Just add ?pipeline=your_pipeline to the bulk request.

apanimesh061 · April 15, 2017, 8:21pm

@dadoonet Thanks for the response.

I used this command to ingest a document using a pipeline I created:

curl -X PUT 'http://localhost:9200/test_index/review/1?pipeline=apply-vader-review' -d '{
	"content": "The plot was good, but the characters are uncompelling and the dialog is not great."
}'

The ingested document looks like this:

{
    "_index":"test_index",
    "_type":"review",
    "_id":"1",
    "_score":1,
    "_source":{
        "content":"The plot was good, but the characters are uncompelling and the dialog is not great.",
        "polarity":{
            "negative":0.327,
            "neutral":0.579,
            "positive":0.094,
            "compound":-0.7042
        }
    }
}

I see that I have to give an _id for a request. How will I follow this format if I want the document id to be auto-generated during a bulk ingestion?

apanimesh061 · April 15, 2017, 9:05pm

@dadoonet

I figured it out. This is _bulk request I tried and was successful.

curl -X POST 'http://localhost:9200/test_index/review/_bulk?pipeline=apply-vader-review' -d 
'{ "index" : { "_index" : "test_index", "_type" : "review"} }
{"content": "The plot was good, but the characters are uncompelling and the dialog is not great."}
{ "index" : { "_index" : "test_index", "_type" : "review"} }
{"content": "The plot was good, but the characters are uncompelling and the dialog is not great."}
{ "index" : { "_index" : "test_index", "_type" : "review"} }
{"content": "The plot was good, but the characters are uncompelling and the dialog is not great."}'

Thanks for your help!

system · May 13, 2017, 9:08pm

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Parallelize request from Ingest Plugin while using bulk API Elasticsearch	1	327	October 8, 2019
ElasticSearch Ingestion issue Elasticsearch	3	476	January 5, 2018
Failed to execute pipeline for a bulk request after a successfull ingest plugin Elasticsearch	2	783	September 22, 2017
Custom Ingest plugin: any way to produce several documents? Elasticsearch	3	633	June 8, 2017
BulkIngester index operation does not propagate pipeline if defined Elasticsearch language-clients	3	164	August 16, 2023

Is it possible to use the ingestion plugin in a bulk request?

Related topics