It sounds a little like the APM Server can't keep up with the amount of data that's being sent to it. Do you know approximately how many HTTP requests and events it's receiving?
If that's the issue, the recommended solution is to spin up multiple APM Servers behind a load balancer.