DataNode自定义监控

最新推荐文章于 2025-08-10 19:30:00 发布

原创最新推荐文章于 2025-08-10 19:30:00 发布 · 1.5k 阅读

1 ·

CC 4.0 BY-SA版权

hadoop 同时被 2 个专栏收录

54 篇文章

订阅专栏

hdfs 日常操作

11 篇文章

订阅专栏

本文介绍了Hadoop DataNode的重要性能指标，包括数据读写操作、缓存使用情况、JVM垃圾回收统计及磁盘空间使用等关键信息，为优化Hadoop集群提供了重要参考。

采集路径：http://XXXXX:50075/jmx?qry=Hadoop:service=DataNode,name=*

一、DataNode性能信息（核心指标）

Hadoop:service=DataNode,name=DataNodeActivity-R720ip67-50010

Metric	Type(GAUGE,COUNTER)	类型	业务意义
BytesWritten	COUNTER		Total number of bytes written to DataNode
BytesRead	COUNTER		Total number of bytes read from DataNode
BlocksWritten	COUNTER		Total number of blocks written to DataNode
BlocksRead	COUNTER		Total number of blocks read from DataNode
BlocksReplicated	COUNTER		Total number of blocks replicated
BlocksRemoved	COUNTER		Total number of blocks removed
BlocksVerified	COUNTER		Total number of blocks verified
BlockVerificationFailures	COUNTER		Total number of verifications failures
BlocksCached	GAUGE		Total number of blocks cached
BlocksUncached	GAUGE		Total number of blocks uncached
ReadsFromLocalClient	COUNTER		Total number of read operations from local client
ReadsFromRemoteClient	COUNTER		Total number of read operations from remote client
WritesFromLocalClient	COUNTER		Total number of write operations from local client
WritesFromRemoteClient	COUNTER		Total number of write operations from remote client
BlocksGetLocalPathInfo	COUNTER		Total number of operations to get local path names of blocks
FsyncCount	COUNTER		Total number of fsync
VolumeFailures	COUNTER		Total number of volume failures occurred
ReadBlockOpNumOps	COUNTER		Total number of read operations
ReadBlockOpAvgTime	GAUGE	ms	Average time of read operations in milliseconds
WriteBlockOpNumOps	COUNTER		Total number of write operations
WriteBlockOpAvgTime	GAUGE	ms	Average time of write operations in milliseconds
BlockChecksumOpNumOps	COUNTER		Total number of blockChecksum operations
BlockChecksumOpAvgTime	GAUGE	ms	Average time of blockChecksum operations in milliseconds
CopyBlockOpNumOps	COUNTER		Total number of block copy operations
CopyBlockOpAvgTime	GAUGE	ms	Average time of block copy operations in milliseconds
ReplaceBlockOpNumOps	COUNTER		Total number of block replace operations
ReplaceBlockOpAvgTime	GAUGE	ms	Average time of block replace operations in milliseconds
HeartbeatsNumOps	COUNTER		Total number of heartbeats
HeartbeatsAvgTime	GAUGE	ms	Average heartbeat time in milliseconds
BlockReportsNumOps	COUNTER		Total number of block report operations
BlockReportsAvgTime	GAUGE	ms	Average time of block report operations in milliseconds
CacheReportsNumOps	COUNTER		Total number of cache report operations
CacheReportsAvgTime	GAUGE	ms	Average time of cache report operations in milliseconds
PacketAckRoundTripTimeNanosNumOps	COUNTER		Total number of ack round trip
PacketAckRoundTripTimeNanosAvgTime	GAUGE	ms	Average time from ack send to receive minus the downstream ack time in nanoseconds
FlushNanosNumOps	COUNTER		Total number of flushes
FlushNanosAvgTime	GAUGE	ms	Average flush time in nanoseconds
FsyncNanosNumOps	COUNTER		Total number of fsync
FsyncNanosAvgTime	GAUGE	ms	Average fsync time in nanoseconds
SendDataPacketBlockedOnNetworkNanosNumOps	COUNTER		Total number of sending packets
SendDataPacketBlockedOnNetworkNanosAvgTime	GAUGE	ms	Average waiting time of sending packets in nanoseconds
SendDataPacketTransferNanosNumOps	COUNTER		Total number of sending packets
SendDataPacketTransferNanosAvgTime	GAUGE	ms	Average transfer time of sending packets in nanoseconds

二、DataNode JvmMetrics详细信息（核心指标）

Hadoop:service=DataNode,name=JvmMetrics

Metric	Type(GAUGE,COUNTER)	类型	业务意义
GcCountParNew	COUNTER		新生代GC次数
GcTimeMillisParNew	COUNTER	ms	新生代GC耗时（ms）
GcCountConcurrentMarkSweep	COUNTER		老年代GC次数
GcTimeMillisConcurrentMarkSweep	COUNTER	ms	老年代GC耗时（ms）
GcCount	COUNTER		总的GC次数
GcTimeMillis	COUNTER	ms	总的GC耗时（ms）

三、DataNode空间信息（核心指标）

Hadoop:service=DataNode,name=FSDatasetState-null

Metric	Type(GAUGE,COUNTER)	类型	业务意义
Capacity	GAUGE	GAUGE	DN容量
DfsUsed	GAUGE	GAUGE	DN已经使用的容量
NumFailedVolumes	GAUGE	GAUGE	DN坏卷的个数