Hi
We recently enrolled about 300 fleet clients to our On-Prem ECE cluster, enrollment seems to be working fine but duplicates( marked as offline) are spawning en mass. Data are being ingested from machines with double entries and i can unenroll the duplicates based on time last seen. But why is this happening?
In addition to that, about half of the clients are "offline" at any given time with the following error code in the logs
[elastic_agent][error] Could not communicate with fleet-server Checking API will retry, error: status code: 429, fleet-server returned an error: MaxLimit, message: exceeded the max limit
I have followed the settings guide for the fleet server integration policy and adjusted it to accomodate 5000 client, with no change.
cache:
num_counters: 10000
max_cost: 10485760
server.limits:
policy_throttle: 100ms
checkin_limit:
interval: 10ms
burst: 2500
max: 2601
artifact_limit:
interval: 10ms
burst: 2500
max: 5000
ack_limit:
interval: 8ms
burst: 2500
max: 5000
enroll_limit:
interval: 40ms
burst: 250
max: 500
server.runtime:
gc_percent: 20
Any ideas?