_http_request_failure in logstash while using http_poller

Navneet_Mathpal · October 5, 2015, 1:48pm

Hi ,

I am getting some json documents from http_poller

input {
http_poller {
urls => {
"localhost" => "http://localhost:9200"
}
interval => 10

After I reads all the document It shows _http_request_failure

1 . Is it because of logstash pinging the URL in every 10 sec and if it does not get any new doc over there it will show this _http_request_failure ?
2. If any new doc get updated in URL will http_poller will be able to take it in real time ? without taking the older docs ?

Thanks

magnusbaeck · October 5, 2015, 1:53pm

Not sure what you mean by older and newer docs. http://localhost:9200 will only return Elasticsearch's rarely-changing status document and the http_poller just makes an HTTP request and passes the results to Logstash.

Navneet_Mathpal · October 5, 2015, 2:01pm

I mean , if I have 200 json docs availble in my url "www.examplecom/jsonfile".
After running the http_poller I will get 200 json docs , what if one new doc get updated , will http_poller only take that new updated doc or it will take all the 201 docs again ?

why this _http_request_failure comes ?

magnusbaeck · October 5, 2015, 2:13pm

Okay, so http://localhost:9200 was just a randomly picked URL? That was not obvious.

After running the http_poller I will get 200 json docs , what if one new doc get updated , will http_poller only take that new updated doc or it will take all the 201 docs again ?

http_poller does not maintain any state, i.e. it has no idea of what documents are new.

why this httprequest_failure comes ?

The resulting event's @metadata field (normally not emitted by outputs but available with e.g. stdout { codec => rubydebug }) should contain details about the failure.

Navneet_Mathpal · October 5, 2015, 5:30pm

Thank you @magnusbaeck

So we can handle the repetitive doc using document_id.

1 .If I have 1m json records available in my URL , AND the url is referencing in every 10 sec .(and let us suppose http_poller reading 100 doc/sec ) its means that htttp_poller can never read the whole file ?

magnusbaeck · October 5, 2015, 5:55pm

Your architecture doesn't sound very sustainable, at least not with the update frequency you have in mind. Those 1 million documents have to weigh, what, 100 MB or more? You really want to do some kind of incremental polling ("give me everything that's been updated since X") or that the origin system sends updates to a broker that you can read from.

Topic		Replies	Views
Http_request_failure when trying to get logs with http_poller plugin logstash Logstash	10	2569	July 6, 2017
_source field gets: "http_request_failure" Logstash	6	1136	January 12, 2018
Getting http_request_failure error Logstash	2	989	February 9, 2017
Http_poller_plugin: Index document issue Logstash	1	227	May 5, 2020
Logstash http poller "_http_request_failure" : The server failed to respond with a valid HTTP response Logstash	1	1178	September 21, 2018

_http_request_failure in logstash while using http_poller

Related topics