1. 27882:20160620:140553.557 cannot send list of active checks to [43.239.176.21]: host [IP] not found
修改客户端zabbix_agent.conf的Hostname 即可
2. 25415:20160620:135024.220 Zabbix agent item "flow.eth0" on host "39.109.11.14" failed: first network error, wait for 15 seconds
这个自定义脚本执行时间至少需要5s,而zabbix timeout默认为3,修改timeout值即可
3. 28012:20160620:135342.066 item "14.152.90.85:flow.eth0" became not supported: Received value [1.6] is not suitable for value type [Numeric (unsigned)] and data type [Decimal]
自定义的item值类型有问题,修改为对应的浮点型即可
4.Lack of free swap space on x.x.x.x
到客户端创建swap空间即可
5.zabbix每项设置都对,但是触发不了action
administration-->User-->User group -->permission 查看是否有读写权限,默认是deny
6.报警邮件总是以附件形式发送
进行格式转换,比如mail命令脚本可以改写成以下形式:
#!/bin/bash
echo "$3">/tmp/mail.txt
dos2unix /tmp/mail.txt
mail -s "$2" $1 </tmp/mail.txt
7.zabbix出现大量:Zabbix agent on x.x.x.x is unreachable for 5 minutes
(1)在服务端zabbix_get -s x.x.x.x -k 'agent.ping' 正常获取值,排除客户端配置和网络问题
(2)查日志发现大量MySQL server has gone away报错
(3)参考:https://blog.51cto.com/itnihao/1733492 重新编译zabbix
(4)报错变成大量的“resuming Zabbix agent checks on host [Zabbix server]: connection restored”
(5)怀疑zabbix内存不够,修改zabbix_server.conf配置:CacheSize=4096M,加大内存,重启zabbix,问题解决
8.待续
解决问题思路:
有些问题很诡异,比如我的zabbix经常无故报Zabbix agent on x.x.x.x is unreachable for 5 minutes,但是在zabbix server端能正常获取到值,item 和trigger 也没问题。百思不得其解,最后根据zabbix日志里面的内容,把发生问题的选项一一修复好,这个问题也不得而解了,可见zabbix很多组件可能有依赖关系,优先解决error_LOG里面的内容
转载于:https://blog.51cto.com/linzb/1790985