Zlib 1.2.12 getting corruption errors

mailme_gx · April 5, 2022, 5:26am

Hi all

I filed a bug several days ago but have had no response, has no one else had issues after upgrading zlib to 1.2.12?

github.com/elastic/elasticsearch

zlib 1.2.12 getting corruption errors

opened 12:50PM - 31 Mar 22 UTC

closed 08:38AM - 13 Apr 22 UTC

mailme-gx

>bug feedback_needed :Core/Infra/Core Team:Core/Infra

running ES 7.1.2 on archlinux, after zlib was upgraded from 1.2.11 to 1.2.12 the… service did not start Taking this opportunity to upgrade to the latest elasticsearch I installed ES 8.1.0 single node with no existing data and got the same issue, after downgrading zlib both versions of ES work fine sample stack trace ``` {"@timestamp":"2022-03-31T04:58:24.686Z", "log.level": "WARN", "message":"failing [elected-as-master ([1] nodes joined)[{gxdev1}{ntQC1xXORxaS-X7rjU0w-A}{Hye6nHtRT7iZ4qxd9FGUeg}{127.0.0.1}{127.0.0.1:9300}{cdfhilmrstw} completing election, _BECOME_MASTER_TASK_, _FINISH_ELECTION_]]: failed to commit cluster state version [79]", "ecs.version": "1.2.0","service.name":"ES_ECS","event.dataset":"elasticsearch.server","process.thread.name":"elasticsearch[gxdev1][masterService#upda teTask][T#1]","log.logger":"org.elasticsearch.cluster.service.MasterService","elasticsearch.cluster.uuid":"55_PjKTLS5-yDT-K-pkh6w","elasticsearch.node.id":"ntQC1xXORxaS-X7rjU0w-A","elasticsearch.node.name":"gxdev1","elasticsearch.cluster. name":"elasticsearch","error.type":"org.elasticsearch.cluster.coordination.FailedToCommitClusterStateException","error.message":"publication failed","error.stack_trace":"org.elasticsearch.cluster.coordination.FailedToCommitClusterStateExc eption: publication failed\n\tat org.elasticsearch.cluster.coordination.Coordinator$CoordinatorPublication$4.onFailure(Coordinator.java:1718)\n\tat org.elasticsearch.common.util.concurrent.ListenableFuture.notifyListenerDirectly(Listenabl eFuture.java:115)\n\tat org.elasticsearch.common.util.concurrent.ListenableFuture.addListener(ListenableFuture.java:55)\n\tat org.elasticsearch.cluster.coordination.Coordinator$CoordinatorPublication.onCompletion(Coordinator.java:1625)\n\ tat org.elasticsearch.cluster.coordination.Publication.onPossibleCompletion(Publication.java:114)\n\tat org.elasticsearch.cluster.coordination.Publication.onPossibleCommitFailure(Publication.java:165)\n\tat org.elasticsearch.cluster.coord ination.Publication$PublicationTarget$PublishResponseHandler.onFailure(Publication.java:376)\n\tat org.elasticsearch.cluster.coordination.Coordinator$4.onFailure(Coordinator.java:1371)\n\tat org.elasticsearch.cluster.coordination.Publicat ionTransportHandler$PublicationContext$1.onFailure(PublicationTransportHandler.java:360)\n\tat org.elasticsearch.cluster.coordination.PublicationTransportHandler$PublicationContext.lambda$sendClusterStateDiff$7(PublicationTransportHandler .java:438)\n\tat org.elasticsearch.action.ActionListener$DelegatingActionListener.onFailure(ActionListener.java:192)\n\tat org.elasticsearch.action.ActionListener$Delegating.onFailure(ActionListener.java:66)\n\tat org.elasticsearch.action .ActionListener$RunAfterActionListener.onFailure(ActionListener.java:350)\n\tat org.elasticsearch.action.ActionListener$Delegating.onFailure(ActionListener.java:66)\n\tat org.elasticsearch.action.ActionListener$RunAfterActionListener.onFa ilure(ActionListener.java:350)\n\tat org.elasticsearch.action.ActionListenerResponseHandler.handleException(ActionListenerResponseHandler.java:48)\n\tat org.elasticsearch.transport.TransportService$ContextRestoreResponseHandler.handleExce ption(TransportService.java:1349)\n\tat org.elasticsearch.transport.TransportService$DirectResponseChannel.processException(TransportService.java:1458)\n\tat org.elasticsearch.transport.TransportService$DirectResponseChannel$2.run(Transpo rtService.java:1437)\n\tat org.elasticsearch.common.util.concurrent.ThreadContext$ContextPreservingRunnable.run(ThreadContext.java:717)\n\tat java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1136)\n\tat java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:635)\n\tat java.base/java.lang.Thread.run(Thread.java:833)\nCaused by: org.elasticsearch.common.util.concurrent.UncategorizedExecutionException: Failed e xecution\n\tat org.elasticsearch.common.util.concurrent.FutureUtils.rethrowExecutionException(FutureUtils.java:80)\n\tat org.elasticsearch.common.util.concurrent.FutureUtils.get(FutureUtils.java:72)\n\tat org.elasticsearch.common.util.con current.ListenableFuture.notifyListenerDirectly(ListenableFuture.java:112)\n\t... 21 more\nCaused by: java.util.concurrent.ExecutionException: org.apache.lucene.index.CorruptIndexException: checksum failed (hardware problem?) : expected=2 e603023 actual=f0db10c0 (resource=BufferedChecksumIndexInput(NIOFSIndexInput(path=\"/mq_cluster/data/elasticsearch/_state/_9b.fdt\")))\n\tat org.elasticsearch.common.util.concurrent.BaseFuture$Sync.getValue(BaseFuture.java:257)\n\tat org. elasticsearch.common.util.concurrent.BaseFuture$Sync.get(BaseFuture.java:231)\n\tat org.elasticsearch.common.util.concurrent.BaseFuture.get(BaseFuture.java:53)\n\tat org.elasticsearch.common.util.concurrent.FutureUtils.get(FutureUtils.jav a:65)\n\t... 22 more\nCaused by: org.apache.lucene.index.CorruptIndexException: checksum failed (hardware problem?) : expected=2e603023 actual=f0db10c0 (resource=BufferedChecksumIndexInput(NIOFSIndexInput(path=\"/mq_cluster/data/elasticse arch/_state/_9b.fdt\")))\n\tat org.apache.lucene.codecs.CodecUtil.checkFooter(CodecUtil.java:440)\n\tat org.apache.lucene.codecs.lucene90.Lucene90CompoundFormat.writeCompoundFile(Lucene90CompoundFormat.java:123)\n\tat org.apache.lucene.co decs.lucene90.Lucene90CompoundFormat.write(Lucene90CompoundFormat.java:98)\n\tat org.apache.lucene.index.IndexWriter.createCompoundFile(IndexWriter.java:5563)\n\tat org.apache.lucene.index.DocumentsWriterPerThread.sealFlushedSegment(Docum entsWriterPerThread.java:537)\n\tat org.apache.lucene.index.DocumentsWriterPerThread.flush(DocumentsWriterPerThread.java:468)\n\tat org.apache.lucene.index.DocumentsWriter.doFlush(DocumentsWriter.java:497)\n\tat org.apache.lucene.index.Do cumentsWriter.flushAllThreads(DocumentsWriter.java:676)\n\tat org.apache.lucene.index.IndexWriter.doFlush(IndexWriter.java:4014)\n\tat org.apache.lucene.index.IndexWriter.flush(IndexWriter.java:3988)\n\tat org.apache.lucene.index.IndexWri ter.flush(IndexWriter.java:3967)\n\tat org.elasticsearch.gateway.PersistedClusterStateService$MetadataIndexWriter.flush(PersistedClusterStateService.java:692)\n\tat org.elasticsearch.gateway.PersistedClusterStateService$Writer.addMetadata (PersistedClusterStateService.java:991)\n\tat org.elasticsearch.gateway.PersistedClusterStateService$Writer.overwriteMetadata(PersistedClusterStateService.java:975)\n\tat org.elasticsearch.gateway.PersistedClusterStateService$Writer.write FullStateAndCommit(PersistedClusterStateService.java:788)\n\tat org.elasticsearch.gateway.GatewayMetaState$LucenePersistedState.setLastAcceptedState(GatewayMetaState.java:504)\n\tat org.elasticsearch.cluster.coordination.CoordinationState .handlePublishRequest(CoordinationState.java:392)\n\tat org.elasticsearch.cluster.coordination.Coordinator.handlePublishRequest(Coordinator.java:418)\n\tat org.elasticsearch.cluster.coordination.PublicationTransportHandler.acceptState(Pub licationTransportHandler.java:200)\n\tat org.elasticsearch.cluster.coordination.PublicationTransportHandler.handleIncomingPublishRequest(PublicationTransportHandler.java:183)\n\tat org.elasticsearch.cluster.coordination.PublicationTranspo rtHandler.lambda$new$0(PublicationTransportHandler.java:103)\n\tat org.elasticsearch.transport.RequestHandlerRegistry.processMessageReceived(RequestHandlerRegistry.java:67)\n\tat org.elasticsearch.transport.TransportService$6.doRun(Transp ortService.java:917)\n\tat org.elasticsearch.common.util.concurrent.ThreadContext$ContextPreservingAbstractRunnable.doRun(ThreadContext.java:776)\n\tat org.elasticsearch.common.util.concurrent.AbstractRunnable.run(AbstractRunnable.java:26 )\n\t... 3 more\n"} ``` ``` org.elasticsearch.cluster.coordination.FailedToCommitClusterStateException: publication failed at org.elasticsearch.cluster.coordination.Coordinator$CoordinatorPublication$4.onFailure(Coordinator.java:1718) at org.elasticsearch.common.util.concurrent.ListenableFuture.notifyListenerDirectly(ListenableFuture.java:115) at org.elasticsearch.common.util.concurrent.ListenableFuture.addListener(ListenableFuture.java:55) at org.elasticsearch.cluster.coordination.Coordinator$CoordinatorPublication.onCompletion(Coordinator.java:1625) at org.elasticsearch.cluster.coordination.Publication.onPossibleCompletion(Publication.java:114) at org.elasticsearch.cluster.coordination.Publication.onPossibleCommitFailure(Publication.java:165) at org.elasticsearch.cluster.coordination.Publication$PublicationTarget$PublishResponseHandler.onFailure(Publication.java:376) at org.elasticsearch.cluster.coordination.Coordinator$4.onFailure(Coordinator.java:1371) at org.elasticsearch.cluster.coordination.PublicationTransportHandler$PublicationContext$1.onFailure(PublicationTransportHandler.java:360) at org.elasticsearch.cluster.coordination.PublicationTransportHandler$PublicationContext.lambda$sendClusterStateDiff$7(PublicationTransportHandler.java:438) at org.elasticsearch.action.ActionListener$DelegatingActionListener.onFailure(ActionListener.java:192) at org.elasticsearch.action.ActionListener$Delegating.onFailure(ActionListener.java:66) at org.elasticsearch.action.ActionListener$RunAfterActionListener.onFailure(ActionListener.java:350) at org.elasticsearch.action.ActionListener$Delegating.onFailure(ActionListener.java:66) at org.elasticsearch.action.ActionListener$RunAfterActionListener.onFailure(ActionListener.java:350) at org.elasticsearch.action.ActionListenerResponseHandler.handleException(ActionListenerResponseHandler.java:48) at org.elasticsearch.transport.TransportService$ContextRestoreResponseHandler.handleException(TransportService.java:1349) at org.elasticsearch.transport.TransportService$DirectResponseChannel.processException(TransportService.java:1458) at org.elasticsearch.transport.TransportService$DirectResponseChannel$2.run(TransportService.java:1437) at org.elasticsearch.common.util.concurrent.ThreadContext$ContextPreservingRunnable.run(ThreadContext.java:717) at java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1136) atjava.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:635) at java.base/java.lang.Thread.run(Thread.java:833) Caused by: org.elasticsearch.common.util.concurrent.UncategorizedExecutionException: Failed execution at org.elasticsearch.common.util.concurrent.FutureUtils.rethrowExecutionException(FutureUtils.java:80) at org.elasticsearch.common.util.concurrent.FutureUtils.get(FutureUtils.java:72) at org.elasticsearch.common.util.concurrent.ListenableFuture.notifyListenerDirectly(ListenableFuture.java:112) ... 21 more Caused by: java.util.concurrent.ExecutionException: org.apache.lucene.index.CorruptIndexException: checksum failed (hardware problem?) : expected=2e603023 actual=f0db10c0 (resource=BufferedChecksumIndexInput(NIOFSIndexInput(path="/mq_cluster/data/elasticsearch/_state/_9b.fdt"))) at org.elasticsearch.common.util.concurrent.BaseFuture$Sync.getValue(BaseFuture.java:257) at org.elasticsearch.common.util.concurrent.BaseFuture$Sync.get(BaseFuture.java:231) at org.elasticsearch.common.util.concurrent.BaseFuture.get(BaseFuture.java:53) at org.elasticsearch.common.util.concurrent.FutureUtils.get(FutureUtils.java:65) ... 22 more Caused by: org.apache.lucene.index.CorruptIndexException: checksum failed (hardware problem?) : expected=2e603023 actual=f0db10c0 (resource=BufferedChecksumIndexInput(NIOFSIndexInput(path="/mq_cluster/data/elasticsearch/_state/_9b.fdt"))) at org.apache.lucene.codecs.CodecUtil.checkFooter(CodecUtil.java:440) at org.apache.lucene.codecs.lucene90.Lucene90CompoundFormat.writeCompoundFile(Lucene90CompoundFormat.java:123) at org.apache.lucene.codecs.lucene90.Lucene90CompoundFormat.write(Lucene90CompoundFormat.java:98) at org.apache.lucene.index.IndexWriter.createCompoundFile(IndexWriter.java:5563) at org.apache.lucene.index.DocumentsWriterPerThread.sealFlushedSegment(DocumentsWriterPerThread.java:537) at org.apache.lucene.index.DocumentsWriterPerThread.flush(DocumentsWriterPerThread.java:468) at org.apache.lucene.index.DocumentsWriter.doFlush(DocumentsWriter.java:497) at org.apache.lucene.index.DocumentsWriter.flushAllThreads(DocumentsWriter.java:676) at org.apache.lucene.index.IndexWriter.doFlush(IndexWriter.java:4014) at org.apache.lucene.index.IndexWriter.flush(IndexWriter.java:3988) at org.apache.lucene.index.IndexWriter.flush(IndexWriter.java:3967) at org.elasticsearch.gateway.PersistedClusterStateService$MetadataIndexWriter.flush(PersistedClusterStateService.java:692) at org.elasticsearch.gateway.PersistedClusterStateService$Writer.addMetadata(PersistedClusterStateService.java:991) at org.elasticsearch.gateway.PersistedClusterStateService$Writer.overwriteMetadata(PersistedClusterStateService.java:975) at org.elasticsearch.gateway.PersistedClusterStateService$Writer.writeFullStateAndCommit(PersistedClusterStateService.java:788) at org.elasticsearch.gateway.GatewayMetaState$LucenePersistedState.setLastAcceptedState(GatewayMetaState.java:504) at org.elasticsearch.cluster.coordination.CoordinationState.handlePublishRequest(CoordinationState.java:392) at org.elasticsearch.cluster.coordination.Coordinator.handlePublishRequest(Coordinator.java:418) at org.elasticsearch.cluster.coordination.PublicationTransportHandler.acceptState(PublicationTransportHandler.java:200) at org.elasticsearch.cluster.coordination.PublicationTransportHandler.handleIncomingPublishRequest(PublicationTransportHandler.java:183) at org.elasticsearch.cluster.coordination.PublicationTransportHandler.lambda$new$0(PublicationTransportHandler.java:103) at org.elasticsearch.transport.RequestHandlerRegistry.processMessageReceived(RequestHandlerRegistry.java:67) at org.elasticsearch.transport.TransportService$6.doRun(TransportService.java:917) at org.elasticsearch.common.util.concurrent.ThreadContext$ContextPreservingAbstractRunnable.doRun(ThreadContext.java:776) at org.elasticsearch.common.util.concurrent.AbstractRunnable.run(AbstractRunnable.java:26 ... 3 more ```

I tried this on a new clean data directory with a single node, if I downgrade to 1.2.11 the service starts fine and I can create indexes and index documents, when I use zlib 1.2.12 the service fails with errors

/usr/share/elasticsearch/logs/elasticsearch.log

[2022-04-05T05:06:08,301][ERROR][o.e.b.ElasticsearchUncaughtExceptionHandler] [gxdev1] uncaught exception in thread [main]
org.elasticsearch.bootstrap.StartupException: org.elasticsearch.ElasticsearchException: failed to load metadata
	at org.elasticsearch.bootstrap.Elasticsearch.init(Elasticsearch.java:170) ~[elasticsearch-8.1.0.jar:8.1.0]
	at org.elasticsearch.bootstrap.Elasticsearch.execute(Elasticsearch.java:157) ~[elasticsearch-8.1.0.jar:8.1.0]
	at org.elasticsearch.common.cli.EnvironmentAwareCommand.execute(EnvironmentAwareCommand.java:81) ~[elasticsearch-8.1.0.jar:8.1.0]
	at org.elasticsearch.cli.Command.mainWithoutErrorHandling(Command.java:112) ~[elasticsearch-cli-8.1.0.jar:8.1.0]
	at org.elasticsearch.cli.Command.main(Command.java:77) ~[elasticsearch-cli-8.1.0.jar:8.1.0]
	at org.elasticsearch.bootstrap.Elasticsearch.main(Elasticsearch.java:122) ~[elasticsearch-8.1.0.jar:8.1.0]
	at org.elasticsearch.bootstrap.Elasticsearch.main(Elasticsearch.java:80) ~[elasticsearch-8.1.0.jar:8.1.0]
Caused by: org.elasticsearch.ElasticsearchException: failed to load metadata
	at org.elasticsearch.gateway.GatewayMetaState.start(GatewayMetaState.java:162) ~[elasticsearch-8.1.0.jar:8.1.0]
	at org.elasticsearch.node.Node.start(Node.java:1142) ~[elasticsearch-8.1.0.jar:8.1.0]
	at org.elasticsearch.bootstrap.Bootstrap.start(Bootstrap.java:272) ~[elasticsearch-8.1.0.jar:8.1.0]
	at org.elasticsearch.bootstrap.Bootstrap.init(Bootstrap.java:367) ~[elasticsearch-8.1.0.jar:8.1.0]
	at org.elasticsearch.bootstrap.Elasticsearch.init(Elasticsearch.java:166) ~[elasticsearch-8.1.0.jar:8.1.0]
	... 6 more
Caused by: org.apache.lucene.index.CorruptIndexException: checksum failed (hardware problem?) : expected=226868ae actual=fcd3484d (resource=BufferedChecksumIndexInput(NIOFSIndexInput(path="/mq_cluster/data/elasticsearch/_state/_9c.fdt"))
	at org.apache.lucene.codecs.CodecUtil.checkFooter(CodecUtil.java:440) ~[lucene-core-9.0.0.jar:9.0.0 0b18b3b965cedaf5eb129aa41243a44c83ca826d - jpountz - 2021-12-01 14:23:49]
	at org.apache.lucene.codecs.lucene90.Lucene90CompoundFormat.writeCompoundFile(Lucene90CompoundFormat.java:123) ~[lucene-core-9.0.0.jar:9.0.0 0b18b3b965cedaf5eb129aa41243a44c83ca826d - jpountz - 2021-12-01 14:23:49]
	at org.apache.lucene.codecs.lucene90.Lucene90CompoundFormat.write(Lucene90CompoundFormat.java:98) ~[lucene-core-9.0.0.jar:9.0.0 0b18b3b965cedaf5eb129aa41243a44c83ca826d - jpountz - 2021-12-01 14:23:49]
	at org.apache.lucene.index.IndexWriter.createCompoundFile(IndexWriter.java:5563) ~[lucene-core-9.0.0.jar:9.0.0 0b18b3b965cedaf5eb129aa41243a44c83ca826d - jpountz - 2021-12-01 14:23:49]
	at org.apache.lucene.index.DocumentsWriterPerThread.sealFlushedSegment(DocumentsWriterPerThread.java:537) ~[lucene-core-9.0.0.jar:9.0.0 0b18b3b965cedaf5eb129aa41243a44c83ca826d - jpountz - 2021-12-01 14:23:49]
	at org.apache.lucene.index.DocumentsWriterPerThread.flush(DocumentsWriterPerThread.java:468) ~[lucene-core-9.0.0.jar:9.0.0 0b18b3b965cedaf5eb129aa41243a44c83ca826d - jpountz - 2021-12-01 14:23:49]
	at org.apache.lucene.index.DocumentsWriter.doFlush(DocumentsWriter.java:497) ~[lucene-core-9.0.0.jar:9.0.0 0b18b3b965cedaf5eb129aa41243a44c83ca826d - jpountz - 2021-12-01 14:23:49]
	at org.apache.lucene.index.DocumentsWriter.flushAllThreads(DocumentsWriter.java:676) ~[lucene-core-9.0.0.jar:9.0.0 0b18b3b965cedaf5eb129aa41243a44c83ca826d - jpountz - 2021-12-01 14:23:49]
	at org.apache.lucene.index.IndexWriter.doFlush(IndexWriter.java:4014) ~[lucene-core-9.0.0.jar:9.0.0 0b18b3b965cedaf5eb129aa41243a44c83ca826d - jpountz - 2021-12-01 14:23:49]
	at org.apache.lucene.index.IndexWriter.flush(IndexWriter.java:3988) ~[lucene-core-9.0.0.jar:9.0.0 0b18b3b965cedaf5eb129aa41243a44c83ca826d - jpountz - 2021-12-01 14:23:49]
	at org.apache.lucene.index.IndexWriter.flush(IndexWriter.java:3967) ~[lucene-core-9.0.0.jar:9.0.0 0b18b3b965cedaf5eb129aa41243a44c83ca826d - jpountz - 2021-12-01 14:23:49]
	at org.elasticsearch.gateway.PersistedClusterStateService$MetadataIndexWriter.flush(PersistedClusterStateService.java:692) ~[elasticsearch-8.1.0.jar:8.1.0]
	at org.elasticsearch.gateway.PersistedClusterStateService$Writer.addMetadata(PersistedClusterStateService.java:991) ~[elasticsearch-8.1.0.jar:8.1.0]
	at org.elasticsearch.gateway.PersistedClusterStateService$Writer.overwriteMetadata(PersistedClusterStateService.java:975) ~[elasticsearch-8.1.0.jar:8.1.0]
	at org.elasticsearch.gateway.PersistedClusterStateService$Writer.writeFullStateAndCommit(PersistedClusterStateService.java:788) ~[elasticsearch-8.1.0.jar:8.1.0]
	at org.elasticsearch.gateway.GatewayMetaState$LucenePersistedState.<init>(GatewayMetaState.java:450) ~[elasticsearch-8.1.0.jar:8.1.0]
	at org.elasticsearch.gateway.GatewayMetaState.start(GatewayMetaState.java:131) ~[elasticsearch-8.1.0.jar:8.1.0]
	at org.elasticsearch.node.Node.start(Node.java:1142) ~[elasticsearch-8.1.0.jar:8.1.0]
	at org.elasticsearch.bootstrap.Bootstrap.start(Bootstrap.java:272) ~[elasticsearch-8.1.0.jar:8.1.0]
	at org.elasticsearch.bootstrap.Bootstrap.init(Bootstrap.java:367) ~[elasticsearch-8.1.0.jar:8.1.0]
	at org.elasticsearch.bootstrap.Elasticsearch.init(Elasticsearch.java:166) ~[elasticsearch-8.1.0.jar:8.1.0]
	... 6 more
[2022-04-05T05:06:08,309][INFO ][o.e.n.Node               ] [gxdev1] stopping ...
[2022-04-05T05:06:08,353][INFO ][o.e.n.Node               ] [gxdev1] stopped
[2022-04-05T05:06:08,354][INFO ][o.e.n.Node               ] [gxdev1] closing ...
[2022-04-05T05:06:08,369][INFO ][o.e.n.Node               ] [gxdev1] closed
[2022-04-05T05:06:08,371][INFO ][o.e.x.m.p.NativeController] [gxdev1] Native controller process has stopped - no new native processes can be started

warkolm · April 5, 2022, 5:57am

7.1 is EOL and no longer supported. It's likely your issue will be closed, directing you to upgrade to a supported version - 7.10.x or above.

DavidTurner · April 5, 2022, 8:25am

While it's true that they should definitely move away from 7.1.2, and also Arch isn't supported, the OP indicates that this is a problem in 8.1.0 too. Latest zlib does include some changes in how CRCs are calculated which could be having an impact here, although I haven't been able to reproduce the failure myself.

warkolm · April 5, 2022, 8:32am

Ahh I missed that in the logs.

mailme_gx · April 6, 2022, 1:42am

this was resolved with help from bug report on github DaveCTurner pointed out it is a CPU specific issue, in proxmox change cpu (from kvm64) to Haswell

github.com/elastic/elasticsearch

zlib 1.2.12 getting corruption errors

opened 12:50PM - 31 Mar 22 UTC

mailme-gx

running ES 7.1.2 on archlinux, after zlib was upgraded from 1.2.11 to 1.2.12 the… service did not start Taking this opportunity to upgrade to the latest elasticsearch I installed ES 8.1.0 single node with no existing data and got the same issue, after downgrading zlib both versions of ES work fine sample stack trace ``` {"@timestamp":"2022-03-31T04:58:24.686Z", "log.level": "WARN", "message":"failing [elected-as-master ([1] nodes joined)[{gxdev1}{ntQC1xXORxaS-X7rjU0w-A}{Hye6nHtRT7iZ4qxd9FGUeg}{127.0.0.1}{127.0.0.1:9300}{cdfhilmrstw} completing election, _BECOME_MASTER_TASK_, _FINISH_ELECTION_]]: failed to commit cluster state version [79]", "ecs.version": "1.2.0","service.name":"ES_ECS","event.dataset":"elasticsearch.server","process.thread.name":"elasticsearch[gxdev1][masterService#upda teTask][T#1]","log.logger":"org.elasticsearch.cluster.service.MasterService","elasticsearch.cluster.uuid":"55_PjKTLS5-yDT-K-pkh6w","elasticsearch.node.id":"ntQC1xXORxaS-X7rjU0w-A","elasticsearch.node.name":"gxdev1","elasticsearch.cluster. name":"elasticsearch","error.type":"org.elasticsearch.cluster.coordination.FailedToCommitClusterStateException","error.message":"publication failed","error.stack_trace":"org.elasticsearch.cluster.coordination.FailedToCommitClusterStateExc eption: publication failed\n\tat org.elasticsearch.cluster.coordination.Coordinator$CoordinatorPublication$4.onFailure(Coordinator.java:1718)\n\tat org.elasticsearch.common.util.concurrent.ListenableFuture.notifyListenerDirectly(Listenabl eFuture.java:115)\n\tat org.elasticsearch.common.util.concurrent.ListenableFuture.addListener(ListenableFuture.java:55)\n\tat org.elasticsearch.cluster.coordination.Coordinator$CoordinatorPublication.onCompletion(Coordinator.java:1625)\n\ tat org.elasticsearch.cluster.coordination.Publication.onPossibleCompletion(Publication.java:114)\n\tat org.elasticsearch.cluster.coordination.Publication.onPossibleCommitFailure(Publication.java:165)\n\tat org.elasticsearch.cluster.coord ination.Publication$PublicationTarget$PublishResponseHandler.onFailure(Publication.java:376)\n\tat org.elasticsearch.cluster.coordination.Coordinator$4.onFailure(Coordinator.java:1371)\n\tat org.elasticsearch.cluster.coordination.Publicat ionTransportHandler$PublicationContext$1.onFailure(PublicationTransportHandler.java:360)\n\tat org.elasticsearch.cluster.coordination.PublicationTransportHandler$PublicationContext.lambda$sendClusterStateDiff$7(PublicationTransportHandler .java:438)\n\tat org.elasticsearch.action.ActionListener$DelegatingActionListener.onFailure(ActionListener.java:192)\n\tat org.elasticsearch.action.ActionListener$Delegating.onFailure(ActionListener.java:66)\n\tat org.elasticsearch.action .ActionListener$RunAfterActionListener.onFailure(ActionListener.java:350)\n\tat org.elasticsearch.action.ActionListener$Delegating.onFailure(ActionListener.java:66)\n\tat org.elasticsearch.action.ActionListener$RunAfterActionListener.onFa ilure(ActionListener.java:350)\n\tat org.elasticsearch.action.ActionListenerResponseHandler.handleException(ActionListenerResponseHandler.java:48)\n\tat org.elasticsearch.transport.TransportService$ContextRestoreResponseHandler.handleExce ption(TransportService.java:1349)\n\tat org.elasticsearch.transport.TransportService$DirectResponseChannel.processException(TransportService.java:1458)\n\tat org.elasticsearch.transport.TransportService$DirectResponseChannel$2.run(Transpo rtService.java:1437)\n\tat org.elasticsearch.common.util.concurrent.ThreadContext$ContextPreservingRunnable.run(ThreadContext.java:717)\n\tat java.base/java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1136)\n\tat java.base/java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:635)\n\tat java.base/java.lang.Thread.run(Thread.java:833)\nCaused by: org.elasticsearch.common.util.concurrent.UncategorizedExecutionException: Failed e xecution\n\tat org.elasticsearch.common.util.concurrent.FutureUtils.rethrowExecutionException(FutureUtils.java:80)\n\tat org.elasticsearch.common.util.concurrent.FutureUtils.get(FutureUtils.java:72)\n\tat org.elasticsearch.common.util.con current.ListenableFuture.notifyListenerDirectly(ListenableFuture.java:112)\n\t... 21 more\nCaused by: java.util.concurrent.ExecutionException: org.apache.lucene.index.CorruptIndexException: checksum failed (hardware problem?) : expected=2 e603023 actual=f0db10c0 (resource=BufferedChecksumIndexInput(NIOFSIndexInput(path=\"/mq_cluster/data/elasticsearch/_state/_9b.fdt\")))\n\tat org.elasticsearch.common.util.concurrent.BaseFuture$Sync.getValue(BaseFuture.java:257)\n\tat org. elasticsearch.common.util.concurrent.BaseFuture$Sync.get(BaseFuture.java:231)\n\tat org.elasticsearch.common.util.concurrent.BaseFuture.get(BaseFuture.java:53)\n\tat org.elasticsearch.common.util.concurrent.FutureUtils.get(FutureUtils.jav a:65)\n\t... 22 more\nCaused by: org.apache.lucene.index.CorruptIndexException: checksum failed (hardware problem?) : expected=2e603023 actual=f0db10c0 (resource=BufferedChecksumIndexInput(NIOFSIndexInput(path=\"/mq_cluster/data/elasticse arch/_state/_9b.fdt\")))\n\tat org.apache.lucene.codecs.CodecUtil.checkFooter(CodecUtil.java:440)\n\tat org.apache.lucene.codecs.lucene90.Lucene90CompoundFormat.writeCompoundFile(Lucene90CompoundFormat.java:123)\n\tat org.apache.lucene.co decs.lucene90.Lucene90CompoundFormat.write(Lucene90CompoundFormat.java:98)\n\tat org.apache.lucene.index.IndexWriter.createCompoundFile(IndexWriter.java:5563)\n\tat org.apache.lucene.index.DocumentsWriterPerThread.sealFlushedSegment(Docum entsWriterPerThread.java:537)\n\tat org.apache.lucene.index.DocumentsWriterPerThread.flush(DocumentsWriterPerThread.java:468)\n\tat org.apache.lucene.index.DocumentsWriter.doFlush(DocumentsWriter.java:497)\n\tat org.apache.lucene.index.Do cumentsWriter.flushAllThreads(DocumentsWriter.java:676)\n\tat org.apache.lucene.index.IndexWriter.doFlush(IndexWriter.java:4014)\n\tat org.apache.lucene.index.IndexWriter.flush(IndexWriter.java:3988)\n\tat org.apache.lucene.index.IndexWri ter.flush(IndexWriter.java:3967)\n\tat org.elasticsearch.gateway.PersistedClusterStateService$MetadataIndexWriter.flush(PersistedClusterStateService.java:692)\n\tat org.elasticsearch.gateway.PersistedClusterStateService$Writer.addMetadata (PersistedClusterStateService.java:991)\n\tat org.elasticsearch.gateway.PersistedClusterStateService$Writer.overwriteMetadata(PersistedClusterStateService.java:975)\n\tat org.elasticsearch.gateway.PersistedClusterStateService$Writer.write FullStateAndCommit(PersistedClusterStateService.java:788)\n\tat org.elasticsearch.gateway.GatewayMetaState$LucenePersistedState.setLastAcceptedState(GatewayMetaState.java:504)\n\tat org.elasticsearch.cluster.coordination.CoordinationState .handlePublishRequest(CoordinationState.java:392)\n\tat org.elasticsearch.cluster.coordination.Coordinator.handlePublishRequest(Coordinator.java:418)\n\tat org.elasticsearch.cluster.coordination.PublicationTransportHandler.acceptState(Pub licationTransportHandler.java:200)\n\tat org.elasticsearch.cluster.coordination.PublicationTransportHandler.handleIncomingPublishRequest(PublicationTransportHandler.java:183)\n\tat org.elasticsearch.cluster.coordination.PublicationTranspo rtHandler.lambda$new$0(PublicationTransportHandler.java:103)\n\tat org.elasticsearch.transport.RequestHandlerRegistry.processMessageReceived(RequestHandlerRegistry.java:67)\n\tat org.elasticsearch.transport.TransportService$6.doRun(Transp ortService.java:917)\n\tat org.elasticsearch.common.util.concurrent.ThreadContext$ContextPreservingAbstractRunnable.doRun(ThreadContext.java:776)\n\tat org.elasticsearch.common.util.concurrent.AbstractRunnable.run(AbstractRunnable.java:26 )\n\t... 3 more\n"} ```

system · May 4, 2022, 1:43am

This topic was automatically closed 28 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
ES upgrade 0.20.6 to 1.3.4 -> CorruptIndexException Elasticsearch	6	556	July 6, 2017
Corrupted Shard on Recovery Elasticsearch	10	739	July 6, 2017
CorruptIndexException when trying to replicate one shard of a new index Elasticsearch	4	1438	July 6, 2017
Damaged ES cluster after upgrade - serious problem - please help Elasticsearch	4	563	July 6, 2017
Corruptindexexception warn message unclear Elasticsearch	5	965	July 6, 2017

Zlib 1.2.12 getting corruption errors

/usr/share/elasticsearch/logs/elasticsearch.log

Related topics