新购入的建兴ZETA 256G,在CentOS 7.2中,用PostgreSQL自带的fsync测试工具pg_test_fsync测试IOPS时,突然IO hang住了。
dmesg报了一堆这样的超时:
现象和网上描述的类似,很多SSD有这样的问题。
res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
[ 895.604149] ata1.00: status: { DRDY }
[ 895.606940] ata1.00: failed command: WRITE FPDMA QUEUED
[ 895.609389] ata1.00: cmd 61/08:e0:38:bd:06/00:00:00:00:00/40 tag 28 ncq 4096 out
res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
[ 895.614144] ata1.00: status: { DRDY }
[ 895.616516] ata1.00: failed command: WRITE FPDMA QUEUED
[ 895.618665] ata1.00: cmd 61/10:e8:00:90:06/02:00:00:00:00/40 tag 29 ncq 270336 out
res 40/00:01:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
[ 895.622940] ata1.00: status: { DRDY }
[ 895.625089] ata1.00: failed command: WRITE FPDMA QUEUED
[ 895.627236] ata1.00: cmd 61/00:f0:00:8c:06/04:00:00:00:00/40 tag 30 ncq 524288 out
res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x4 (timeout)
[ 895.631176] ata1.00: status: { DRDY }
[ 895.633133] ata1: hard resetting link
[ 895.937682] ata1: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
[ 895.940816] ata1.00: ACPI cmd ef/10:03:00:00:00:a0 (SET FEATURES) filtered out
[ 895.940830] ata1.00: ACPI cmd f5/00:00:00:00:00:a0 (SECURITY FREEZE LOCK) filtered out
[ 895.941234] ata1.00: ACPI cmd ef/10:03:00:00:00:a0 (SET FEATURES) filtered out
[ 895.941243] ata1.00: ACPI cmd f5/00:00:00:00:00:a0 (SECURITY FREEZE LOCK) filtered out
[ 895.941314] ata1.00: configured for UDMA/133
[ 895.941356] ata1.00: device reported invalid CHS sector 0
[ 895.941362] ata1.00: device reported invalid CHS sector 0
[ 895.941366] ata1.00: device reported invalid CHS sector 0
[ 895.941369] ata1.00: device reported invalid CHS sector 0
[ 895.941374] ata1.00: device reported invalid CHS sector 0
[ 895.941377] ata1.00: device reported invalid CHS sector 0
[ 895.941381] ata1.00: device reported invalid CHS sector 0
[ 895.941384] ata1.00: device reported invalid CHS sector 0
[ 895.941388] ata1.00: device reported invalid CHS sector 0
[ 895.941392] ata1.00: device reported invalid CHS sector 0
[ 895.941395] ata1.00: device reported invalid CHS sector 0
[ 895.941399] ata1.00: device reported invalid CHS sector 0
[ 895.941403] ata1.00: device reported invalid CHS sector 0
[ 895.941408] ata1.00: device reported invalid CHS sector 0
[ 895.941434] ata1: EH complete
现象和网上描述的类似,很多SSD有这样的问题。