Out of memory

TaeSung_Kim · June 7, 2017, 3:05am

elasticsearch 5.2.2 사용 중입니다.
갑작스레 노드가 죽어서 확인을 해보니 로그가 남아있지 않았습니다.

우분투에서 journalctl 명령어로 시스템에 남은 로그를 확인해보니 로그는 아래와 같았습니다.

Jun 06 12:29:26 hmxelk-03 kernel: python2 invoked oom-killer: gfp_mask=0x26000c0, order=2, oom_score_adj=0
Jun 06 12:29:26 hmxelk-03 kernel: Out of memory: Kill process 61808 (java) score 649 or sacrifice child**
Jun 06 12:29:26 hmxelk-03 kernel: Killed process 61808 (java) total-vm:384886604kB, anon-rss:8482112kB, file-rss:627280kB
Jun 06 12:29:26 hmxelk-03 kernel: Out of memory: Kill process 1452 (python3) score 2 or sacrifice child
Jun 06 12:29:26 hmxelk-03 kernel: Killed process 1452 (python3) total-vm:230192kB, anon-rss:28524kB, file-rss:3492kB
Jun 06 12:29:26 hmxelk-03 python3[1353]: 2017/06/06 12:29:26.238705 INFO Installed Agent WALinuxAgent-2.2.2 is the most current agent
Jun 06 12:29:26 hmxelk-03 python3[1353]: 2017/06/06 12:29:26.257563 INFO Agent WALinuxAgent-2.2.2 launched with command 'python3 -u /usr/sbin/waagent -run-exthandlers'
Jun 06 12:29:26 hmxelk-03 systemd[1]: elasticsearch.service: Main process exited, code=killed, status=9/KILL
Jun 06 12:29:26 hmxelk-03 systemd[1]: elasticsearch.service: Unit entered failed state.
Jun 06 12:29:26 hmxelk-03 systemd[1]: elasticsearch.service: Failed with result 'signal'.

MS Azure 가상머신에서 운영 중인데 Out Of Memory가 뜨면서 먼저 python2 가 죽고 이후 java 가 죽고,
python3 가 죽은 뒤 elasticsearch 노드가 죽었는데 혹시 관련해서 조언을 들을 수 있을지 궁금합니다.

기본적으로 elasticsearch의 설정은 아래와 같습니다.

/etc/default/elasticsearch

MAX_OPEN_FILES=65535
MAX_LOCKED_MEMORY=unlimited

/usr/lib/systemd/elasticsearch.service

LimitMEMLOCK=infinity

/etc/security/limits.conf

elasticsearch soft memlock unlimited
elasticsearch hard memlock unlimited

/etc/elasticsearch/elasticsearch.yml

bootstrap.memory_lock: true
indices.fielddata.cache.size: 75%
indices.memory.index_buffer_size: 10%

GET _cluster/settings

    "indices": {
      "breaker": {
        "fielddata": {
          "limit": "77%"
        },
        "request": {
          "limit": "50%"
        },
        "total": {
          "limit": "80%"
        }
      }
    }
  }

경험 있으신 분들이 계시다면 조언 부탁드립니다.
감사합니다.

Brad_Jungsu_Heo · June 7, 2017, 3:55am

oom killer 작동 시점에 ES가 물리램을 8.6GB 정도 사용 중이었군요. Azure 장비의 물리램을 몇 기가이고, ES의 Heap size는 몇 GB를 설정하셨는지요?

TaeSung_Kim · June 7, 2017, 3:59am

답변 주셔서 감사합니다.

TOTAL : 14GB
ES : 7GB

설정했습니다.

Brad_Jungsu_Heo · June 7, 2017, 4:20am

예. ES dedicated된 node라서 전체 RAM의 50%를 ES Heap에 할당하신 것 같군요.

python이랑 ES 다 합쳐도 10GB는 안 되는데, (페북 그룹에 다른 분이 댓글에서 언급하신 커널 버그가 아니라면) 어떤 process가 나머지 메모리를 사용 중이었는지 찾아보셔야겠군요.

큰 도움이 안 되서 죄송하네요 ^^;

TaeSung_Kim · June 7, 2017, 5:35am

도움이 안되다니요.
답변 주셔서 감사하죠.

글자수 제한 때문에 full log를 다 올린 게 아니라서요.
그래도 많이 배웠습니다.
감사합니다.

system · July 5, 2017, 5:36am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
My Elasticsearch Got Killed Frequently due to OOM killer with Out of memory message Elasticsearch elastic-stack-monitoring	7	275	May 29, 2024
Client Nodes being oom-killed Elasticsearch	17	2302	June 28, 2018
Elasticsearch killed by oom-killer Elasticsearch	6	770	July 3, 2023
Elasticsearch process is killed by OOM killer Elasticsearch	4	5266	March 3, 2020
Old 영역 메모리가 꾸준히 증가합니다 한국어 질문 및 토론	4	763	April 2, 2020

Out of memory

Related Topics