HMASTER日志报错2025-11-13 17:30:58,132 INFO [sxs1:16000.activeMasterManager] procedure.ZKProcedureUtil: Clearing all procedure znodes: /hbase/flush-table-proc/acquired /hbase/flush-table-proc/reached /hbase/flush-table-proc/abort
2025-11-13 17:30:58,139 INFO [sxs1:16000.activeMasterManager] procedure.ZKProcedureUtil: Clearing all procedure znodes: /hbase/online-snapshot/acquired /hbase/online-snapshot/reached /hbase/online-snapshot/abort
2025-11-13 17:30:58,152 INFO [sxs1:16000.activeMasterManager] master.MasterCoprocessorHost: System coprocessor loading is enabled
2025-11-13 17:30:58,160 INFO [sxs1:16000.activeMasterManager] procedure2.ProcedureExecutor: Starting procedure executor threads=5
2025-11-13 17:30:58,160 INFO [sxs1:16000.activeMasterManager] wal.WALProcedureStore: Starting WAL Procedure Store lease recovery
2025-11-13 17:30:58,164 INFO [sxs1:16000.activeMasterManager] util.FSHDFSUtils: Recover lease on dfs file hdfs://sxs1:9000/hbase/MasterProcWALs/state-00000000000000000006.log
2025-11-13 17:30:58,168 INFO [sxs1:16000.activeMasterManager] util.FSHDFSUtils: Recovered lease, attempt=0 on file=hdfs://sxs1:9000/hbase/MasterProcWALs/state-00000000000000000006.log after 3ms
2025-11-13 17:30:58,172 INFO [sxs1:16000.activeMasterManager] util.FSHDFSUtils: Recover lease on dfs file hdfs://sxs1:9000/hbase/MasterProcWALs/state-00000000000000000007.log
2025-11-13 17:30:58,173 INFO [sxs1:16000.activeMasterManager] util.FSHDFSUtils: Recovered lease, attempt=0 on file=hdfs://sxs1:9000/hbase/MasterProcWALs/state-00000000000000000007.log after 1ms
2025-11-13 17:30:58,177 INFO [sxs1:16000.activeMasterManager] util.FSHDFSUtils: Recover lease on dfs file hdfs://sxs1:9000/hbase/MasterProcWALs/state-00000000000000000008.log
2025-11-13 17:30:58,178 INFO [sxs1:16000.activeMasterManager] util.FSHDFSUtils: Recovered lease, attempt=0 on file=hdfs://sxs1:9000/hbase/MasterProcWALs/state-00000000000000000008.log after 1ms
2025-11-13 17:30:58,184 INFO [sxs1:16000.activeMasterManager] util.FSHDFSUtils: Recover lease on dfs file hdfs://sxs1:9000/hbase/MasterProcWALs/state-00000000000000000009.log
2025-11-13 17:30:58,188 INFO [sxs1:16000.activeMasterManager] util.FSHDFSUtils: Recovered lease, attempt=0 on file=hdfs://sxs1:9000/hbase/MasterProcWALs/state-00000000000000000009.log after 4ms
2025-11-13 17:30:58,193 INFO [sxs1:16000.activeMasterManager] util.FSHDFSUtils: Recover lease on dfs file hdfs://sxs1:9000/hbase/MasterProcWALs/state-00000000000000000010.log
2025-11-13 17:30:58,195 INFO [sxs1:16000.activeMasterManager] util.FSHDFSUtils: Recovered lease, attempt=0 on file=hdfs://sxs1:9000/hbase/MasterProcWALs/state-00000000000000000010.log after 2ms
2025-11-13 17:30:58,201 INFO [sxs1:16000.activeMasterManager] util.FSHDFSUtils: Recover lease on dfs file hdfs://sxs1:9000/hbase/MasterProcWALs/state-00000000000000000011.log
2025-11-13 17:30:58,201 INFO [sxs1:16000.activeMasterManager] util.FSHDFSUtils: Recovered lease, attempt=0 on file=hdfs://sxs1:9000/hbase/MasterProcWALs/state-00000000000000000011.log after 0ms
2025-11-13 17:30:58,205 INFO [sxs1:16000.activeMasterManager] util.FSHDFSUtils: Recover lease on dfs file hdfs://sxs1:9000/hbase/MasterProcWALs/state-00000000000000000012.log
2025-11-13 17:30:58,205 INFO [sxs1:16000.activeMasterManager] util.FSHDFSUtils: Recovered lease, attempt=0 on file=hdfs://sxs1:9000/hbase/MasterProcWALs/state-00000000000000000012.log after 0ms
2025-11-13 17:30:58,211 INFO [sxs1:16000.activeMasterManager] util.FSHDFSUtils: Recover lease on dfs file hdfs://sxs1:9000/hbase/MasterProcWALs/state-00000000000000000013.log
2025-11-13 17:30:58,212 INFO [sxs1:16000.activeMasterManager] util.FSHDFSUtils: Recovered lease, attempt=0 on file=hdfs://sxs1:9000/hbase/MasterProcWALs/state-00000000000000000013.log after 1ms
2025-11-13 17:30:58,215 INFO [sxs1:16000.activeMasterManager] util.FSHDFSUtils: Recover lease on dfs file hdfs://sxs1:9000/hbase/MasterProcWALs/state-00000000000000000014.log
2025-11-13 17:30:58,216 INFO [sxs1:16000.activeMasterManager] util.FSHDFSUtils: Recovered lease, attempt=0 on file=hdfs://sxs1:9000/hbase/MasterProcWALs/state-00000000000000000014.log after 1ms
2025-11-13 17:30:58,224 INFO [sxs1:16000.activeMasterManager] util.FSHDFSUtils: Recover lease on dfs file hdfs://sxs1:9000/hbase/MasterProcWALs/state-00000000000000000015.log
2025-11-13 17:30:58,225 INFO [sxs1:16000.activeMasterManager] util.FSHDFSUtils: Recovered lease, attempt=0 on file=hdfs://sxs1:9000/hbase/MasterProcWALs/state-00000000000000000015.log after 1ms
2025-11-13 17:30:58,248 INFO [sxs1:16000.activeMasterManager] wal.WALProcedureStore: Lease acquired for flushLogId: 16
2025-11-13 17:30:58,256 INFO [sxs1:16000.activeMasterManager] zookeeper.RecoverableZooKeeper: Process identifier=replicationLogCleaner connecting to ZooKeeper ensemble=sxs1:2181,sxs2:2181,sxs3:2181
2025-11-13 17:30:58,268 INFO [sxs1:16000.activeMasterManager] master.ServerManager: Waiting for region servers count to settle; currently checked in 0, slept for 0 ms, expecting minimum of 1, maximum of 2147483647, timeout of 4500 ms, interval of 1500 ms.
2025-11-13 17:30:59,783 INFO [sxs1:16000.activeMasterManager] master.ServerManager: Waiting for region servers count to settle; currently checked in 0, slept for 1515 ms, expecting minimum of 1, maximum of 2147483647, timeout of 4500 ms, interval of 1500 ms.
2025-11-13 17:31:00,918 INFO [B.defaultRpcServer.handler=1,queue=1,port=16000] master.ServerManager: Registering server=sxs1,16020,1763025688227
2025-11-13 17:31:00,928 INFO [B.defaultRpcServer.handler=0,queue=0,port=16000] master.ServerManager: Registering server=sxs2,16020,1763024791986
2025-11-13 17:31:00,949 INFO [sxs1:16000.activeMasterManager] master.ServerManager: Waiting for region servers count to settle; currently checked in 2, slept for 2681 ms, expecting minimum of 1, maximum of 2147483647, timeout of 4500 ms, interval of 1500 ms.
2025-11-13 17:31:00,966 INFO [B.defaultRpcServer.handler=2,queue=2,port=16000] master.ServerManager: Registering server=sxs3,16020,1763024792019
2025-11-13 17:31:01,004 INFO [sxs1:16000.activeMasterManager] master.ServerManager: Waiting for region servers count to settle; currently checked in 3, slept for 2736 ms, expecting minimum of 1, maximum of 2147483647, timeout of 4500 ms, interval of 1500 ms.
2025-11-13 17:31:02,522 INFO [sxs1:16000.activeMasterManager] master.ServerManager: Waiting for region servers count to settle; currently checked in 3, slept for 4254 ms, expecting minimum of 1, maximum of 2147483647, timeout of 4500 ms, interval of 1500 ms.
2025-11-13 17:31:02,775 INFO [sxs1:16000.activeMasterManager] master.ServerManager: Finished waiting for region servers count to settle; checked in 3, slept for 4507 ms, expecting minimum of 1, maximum of 2147483647, master is running
2025-11-13 17:31:02,782 INFO [sxs1:16000.activeMasterManager] master.MasterFileSystem: Log folder hdfs://sxs1:9000/hbase/WALs/sxs1,16020,1761034573023 doesn't belong to a known region server, splitting
2025-11-13 17:31:02,790 INFO [sxs1:16000.activeMasterManager] master.MasterFileSystem: Log folder hdfs://sxs1:9000/hbase/WALs/sxs1,16020,1762933906517 doesn't belong to a known region server, splitting
2025-11-13 17:31:02,793 INFO [sxs1:16000.activeMasterManager] master.MasterFileSystem: Log folder hdfs://sxs1:9000/hbase/WALs/sxs1,16020,1762938640637 doesn't belong to a known region server, splitting
2025-11-13 17:31:02,793 INFO [sxs1:16000.activeMasterManager] master.MasterFileSystem: Log folder hdfs://sxs1:9000/hbase/WALs/sxs1,16020,1762939757280 doesn't belong to a known region server, splitting
2025-11-13 17:31:02,795 INFO [sxs1:16000.activeMasterManager] master.MasterFileSystem: Log folder hdfs://sxs1:9000/hbase/WALs/sxs1,16020,1763025688227 belongs to an existing region server
2025-11-13 17:31:02,796 INFO [sxs1:16000.activeMasterManager] master.MasterFileSystem: Log folder hdfs://sxs1:9000/hbase/WALs/sxs2,16020,1761034787828-splitting doesn't belong to a known region server, splitting
2025-11-13 17:31:02,797 INFO [sxs1:16000.activeMasterManager] master.MasterFileSystem: Log folder hdfs://sxs1:9000/hbase/WALs/sxs2,16020,1762933902565 doesn't belong to a known region server, splitting
2025-11-13 17:31:02,797 INFO [sxs1:16000.activeMasterManager] master.MasterFileSystem: Log folder hdfs://sxs1:9000/hbase/WALs/sxs2,16020,1762937541819 doesn't belong to a known region server, splitting
2025-11-13 17:31:02,799 INFO [sxs1:16000.activeMasterManager] master.MasterFileSystem: Log folder hdfs://sxs1:9000/hbase/WALs/sxs2,16020,1762938639822 doesn't belong to a known region server, splitting
2025-11-13 17:31:02,800 INFO [sxs1:16000.activeMasterManager] master.MasterFileSystem: Log folder hdfs://sxs1:9000/hbase/WALs/sxs2,16020,1763024791986 belongs to an existing region server
2025-11-13 17:31:02,803 INFO [sxs1:16000.activeMasterManager] master.MasterFileSystem: Log folder hdfs://sxs1:9000/hbase/WALs/sxs3,16020,1761034787825 doesn't belong to a known region server, splitting
2025-11-13 17:31:02,805 INFO [sxs1:16000.activeMasterManager] master.MasterFileSystem: Log folder hdfs://sxs1:9000/hbase/WALs/sxs3,16020,1762933904233 doesn't belong to a known region server, splitting
2025-11-13 17:31:02,806 INFO [sxs1:16000.activeMasterManager] master.MasterFileSystem: Log folder hdfs://sxs1:9000/hbase/WALs/sxs3,16020,1762937541854 doesn't belong to a known region server, splitting
2025-11-13 17:31:02,807 INFO [sxs1:16000.activeMasterManager] master.MasterFileSystem: Log folder hdfs://sxs1:9000/hbase/WALs/sxs3,16020,1762938639774 doesn't belong to a known region server, splitting
2025-11-13 17:31:02,808 INFO [sxs1:16000.activeMasterManager] master.MasterFileSystem: Log folder hdfs://sxs1:9000/hbase/WALs/sxs3,16020,1763024792019 belongs to an existing region server
2025-11-13 17:31:02,814 INFO [sxs1:16000.activeMasterManager] master.SplitLogManager: dead splitlog workers [sxs2,16020,1761034787828]
2025-11-13 17:31:02,816 INFO [sxs1:16000.activeMasterManager] master.SplitLogManager: Started splitting 1 logs in [hdfs://sxs1:9000/hbase/WALs/sxs2,16020,1761034787828-splitting] for [sxs2,16020,1761034787828]
2025-11-13 17:31:02,827 INFO [main-EventThread] coordination.SplitLogManagerCoordination: task /hbase/splitWAL/WALs%2Fsxs2%2C16020%2C1761034787828-splitting%2Fsxs2%252C16020%252C1761034787828..meta.1761038482217.meta acquired by sxs3,16020,1763024792019
2025-11-13 17:31:03,059 INFO [sxs1,16000,1763026256316_splitLogManager__ChoreService_1] master.SplitLogManager: total tasks = 1 unassigned = 0 tasks={/hbase/splitWAL/WALs%2Fsxs2%2C16020%2C1761034787828-splitting%2Fsxs2%252C16020%252C1761034787828..meta.1761038482217.meta=last_update = 1763026262876 last_version = 2 cur_worker_name = sxs3,16020,1763024792019 status = in_progress incarnation = 0 resubmits = 0 batch = installed = 1 done = 0 error = 0}
2025-11-13 17:31:09,058 INFO [sxs1,16000,1763026256316_splitLogManager__ChoreService_1] master.SplitLogManager: total tasks = 1 unassigned = 0 tasks={/hbase/splitWAL/WALs%2Fsxs2%2C16020%2C1761034787828-splitting%2Fsxs2%252C16020%252C1761034787828..meta.1761038482217.meta=last_update = 1763026262876 last_version = 2 cur_worker_name = sxs3,16020,1763024792019 status = in_progress incarnation = 0 resubmits = 0 batch = installed = 1 done = 0 error = 0}
2025-11-13 17:31:15,058 INFO [sxs1,16000,1763026256316_splitLogManager__ChoreService_1] master.SplitLogManager: total tasks = 1 unassigned = 0 tasks={/hbase/splitWAL/WALs%2Fsxs2%2C16020%2C1761034787828-splitting%2Fsxs2%252C16020%252C1761034787828..meta.1761038482217.meta=last_update = 1763026262876 last_version = 2 cur_worker_name = sxs3,16020,1763024792019 status = in_progress incarnation = 0 resubmits = 0 batch = installed = 1 done = 0 error = 0}
2025-11-13 17:31:18,634 INFO [main-EventThread] coordination.SplitLogManagerCoordination: task /hbase/splitWAL/WALs%2Fsxs2%2C16020%2C1761034787828-splitting%2Fsxs2%252C16020%252C1761034787828..meta.1761038482217.meta entered state: ERR sxs3,16020,1763024792019
2025-11-13 17:31:18,634 WARN [main-EventThread] coordination.SplitLogManagerCoordination: Error splitting /hbase/splitWAL/WALs%2Fsxs2%2C16020%2C1761034787828-splitting%2Fsxs2%252C16020%252C1761034787828..meta.1761038482217.meta
2025-11-13 17:31:18,635 WARN [sxs1:16000.activeMasterManager] master.SplitLogManager: error while splitting logs in [hdfs://sxs1:9000/hbase/WALs/sxs2,16020,1761034787828-splitting] installed = 1 but only 0 done
2025-11-13 17:31:18,635 FATAL [sxs1:16000.activeMasterManager] master.HMaster: Failed to become active master
java.io.IOException: error or interrupted while splitting logs in [hdfs://sxs1:9000/hbase/WALs/sxs2,16020,1761034787828-splitting] Task = installed = 1 done = 0 error = 1
at org.apache.hadoop.hbase.master.SplitLogManager.splitLogDistributed(SplitLogManager.java:290)
at org.apache.hadoop.hbase.master.MasterFileSystem.splitLog(MasterFileSystem.java:403)
at org.apache.hadoop.hbase.master.MasterFileSystem.splitMetaLog(MasterFileSystem.java:313)
at org.apache.hadoop.hbase.master.MasterFileSystem.splitMetaLog(MasterFileSystem.java:304)
at org.apache.hadoop.hbase.master.HMaster.splitMetaLogBeforeAssignment(HMaster.java:1046)
at org.apache.hadoop.hbase.master.HMaster.finishActiveMasterInitialization(HMaster.java:750)
at org.apache.hadoop.hbase.master.HMaster.access$600(HMaster.java:189)
at org.apache.hadoop.hbase.master.HMaster$2.run(HMaster.java:1803)
at java.lang.Thread.run(Thread.java:745)
2025-11-13 17:31:18,636 FATAL [sxs1:16000.activeMasterManager] master.HMaster: Master server abort: loaded coprocessors are: []
2025-11-13 17:31:18,636 FATAL [sxs1:16000.activeMasterManager] master.HMaster: Unhandled exception. Starting shutdown.
java.io.IOException: error or interrupted while splitting logs in [hdfs://sxs1:9000/hbase/WALs/sxs2,16020,1761034787828-splitting] Task = installed = 1 done = 0 error = 1
at org.apache.hadoop.hbase.master.SplitLogManager.splitLogDistributed(SplitLogManager.java:290)
at org.apache.hadoop.hbase.master.MasterFileSystem.splitLog(MasterFileSystem.java:403)
at org.apache.hadoop.hbase.master.MasterFileSystem.splitMetaLog(MasterFileSystem.java:313)
at org.apache.hadoop.hbase.master.MasterFileSystem.splitMetaLog(MasterFileSystem.java:304)
at org.apache.hadoop.hbase.master.HMaster.splitMetaLogBeforeAssignment(HMaster.java:1046)
at org.apache.hadoop.hbase.master.HMaster.finishActiveMasterInitialization(HMaster.java:750)
at org.apache.hadoop.hbase.master.HMaster.access$600(HMaster.java:189)
at org.apache.hadoop.hbase.master.HMaster$2.run(HMaster.java:1803)
at java.lang.Thread.run(Thread.java:745)
2025-11-13 17:31:18,636 INFO [sxs1:16000.activeMasterManager] regionserver.HRegionServer: STOPPED: Unhandled exception. Starting shutdown.
2025-11-13 17:31:18,636 INFO [master/sxs1/192.168.78.100:16000] regionserver.HRegionServer: Stopping infoServer
2025-11-13 17:31:18,669 INFO [master/sxs1/192.168.78.100:16000] procedure2.ProcedureExecutor: Stopping the procedure executor
2025-11-13 17:31:18,669 INFO [master/sxs1/192.168.78.100:16000] wal.WALProcedureStore: Stopping the WAL Procedure Store
2025-11-13 17:31:18,776 INFO [master/sxs1/192.168.78.100:16000] regionserver.HRegionServer: stopping server sxs1,16000,1763026256316
2025-11-13 17:31:18,776 INFO [master/sxs1/192.168.78.100:16000] client.ConnectionManager$HConnectionImplementation: Closing zookeeper sessionid=0x39a7c74c6a6000e
2025-11-13 17:31:18,778 INFO [master/sxs1/192.168.78.100:16000] regionserver.HRegionServer: stopping server sxs1,16000,1763026256316; all regions closed.
2025-11-13 17:31:18,779 INFO [master/sxs1/192.168.78.100:16000] hbase.ChoreService: Chore service for: sxs1,16000,1763026256316 had [[ScheduledChore: Name: HFileCleaner Period: 60000 Unit: MILLISECONDS], [ScheduledChore: Name: LogsCleaner Period: 60000 Unit: MILLISECONDS]] on shutdown
2025-11-13 17:31:18,782 INFO [master/sxs1/192.168.78.100:16000] client.ConnectionManager$HConnectionImplementation: Closing zookeeper sessionid=0x29a7c74c6a30007
2025-11-13 17:31:18,785 INFO [master/sxs1/192.168.78.100:16000] hbase.ChoreService: Chore service for: sxs1,16000,1763026256316_splitLogManager_ had [[ScheduledChore: Name: SplitLogManager Timeout Monitor Period: 1000 Unit: MILLISECONDS]] on shutdown
2025-11-13 17:31:18,785 INFO [master/sxs1/192.168.78.100:16000] flush.MasterFlushTableProcedureManager: stop: server shutting down.
2025-11-13 17:31:18,785 INFO [master/sxs1/192.168.78.100:16000] ipc.RpcServer: Stopping server on 16000
2025-11-13 17:31:18,785 INFO [RpcServer.listener,port=16000] ipc.RpcServer: RpcServer.listener,port=16000: stopping
2025-11-13 17:31:18,786 INFO [RpcServer.responder] ipc.RpcServer: RpcServer.responder: stopped
2025-11-13 17:31:18,786 INFO [RpcServer.responder] ipc.RpcServer: RpcServer.responder: stopping
2025-11-13 17:31:18,794 INFO [master/sxs1/192.168.78.100:16000] regionserver.HRegionServer: stopping server sxs1,16000,1763026256316; zookeeper connection closed.
2025-11-13 17:31:18,794 INFO [master/sxs1/192.168.78.100:16000] regionserver.HRegionServer: master/sxs1/192.168.78.100:16000 exiting
最新发布