1. 27882:20160620:140553.557 cannot send list of active checks to [43.239.176.21]: host [IP] not found

    修改客户端zabbix_agent.conf的Hostname 即可


2.    25415:20160620:135024.220 Zabbix agent item "flow.eth0" on host "39.109.11.14" failed: first network error, wait for 15 seconds

    这个自定义脚本执行时间至少需要5s,而zabbix  timeout默认为3,修改timeout值即可


3. 28012:20160620:135342.066 item "14.152.90.85:flow.eth0" became not supported: Received value [1.6] is not suitable for value type [Numeric (unsigned)] and data type [Decimal]

    自定义的item值类型有问题,修改为对应的浮点型即可


4.Lack of free swap space on x.x.x.x  

    到客户端创建swap空间即可


5.zabbix每项设置都对,但是触发不了action

    administration-->User-->User group -->permission  查看是否有读写权限,默认是deny


6.报警邮件总是以附件形式发送

    进行格式转换,比如mail命令脚本可以改写成以下形式:

        #!/bin/bash

        echo "$3">/tmp/mail.txt

        dos2unix /tmp/mail.txt

        mail -s "$2" $1 </tmp/mail.txt


7.zabbix出现大量:Zabbix agent on x.x.x.x is unreachable for 5 minutes

    (1)在服务端zabbix_get -s x.x.x.x  -k 'agent.ping' 正常获取值,排除客户端配置和网络问题

    (2)查日志发现大量MySQL server has gone away报错

    (3)参考:https://blog.51cto.com/itnihao/1733492 重新编译zabbix

    (4)报错变成大量的“resuming Zabbix agent checks on host [Zabbix server]: connection restored”

    (5)怀疑zabbix内存不够,修改zabbix_server.conf配置:CacheSize=4096M,加大内存,重启zabbix,问题解决


8.待续


解决问题思路:

    有些问题很诡异,比如我的zabbix经常无故报Zabbix agent on x.x.x.x is unreachable for 5 minutes,但是在zabbix  server端能正常获取到值,item 和trigger 也没问题。百思不得其解,最后根据zabbix日志里面的内容,把发生问题的选项一一修复好,这个问题也不得而解了,可见zabbix很多组件可能有依赖关系,优先解决error_LOG里面的内容