一、监控架构设计
核心组件与数据流
- Prometheus:时序数据采集、存储与告警规则管理
- Node Exporter:采集主机指标(CPU、内存、磁盘、网络等)
- 数据库Exporter:如
mysqld_exporter、postgres_exporter
- Grafana:数据可视化与仪表盘展示
- Alertmanager(可选):告警通知管理
二、主机环境准备
1. 系统要求
- Linux系统(推荐CentOS 7+/Ubuntu 20.04+)
- 开放端口:9090(Prometheus)、3000(Grafana)、9100(Node Exporter)
- 确保所有节点时间同步(NTP服务)
sudo yum install ntp
sudo systemctl start ntpd
sudo systemctl enable ntpd
sudo apt install ntp
sudo systemctl restart ntp
三、组件安装与配置
1. 安装Prometheus Server
下载二进制包
wget https://github.com/prometheus/prometheus/releases/download/v2.39.1/prometheus-2.39.1.linux-amd64.tar.gz
tar xvfz prometheus-*.tar.gz
sudo mv prometheus-2.39.1.linux-amd64 /usr/local/prometheus
创建系统服务
sudo useradd --no-create-home --shell /bin/false prometheus
sudo mkdir /etc/prometheus /var/lib/prometheus
sudo chown prometheus:prometheus /var/lib/prometheus
sudo cat <<EOF > /etc/systemd/system/prometheus.service
[Unit]
Description=Prometheus
Wants=network-online.target
After=network-online.target
[Service]
User=prometheus
Group=prometheus
ExecStart=/usr/local/prometheus/prometheus \
--config.file=/etc/prometheus/prometheus.yml \
--storage.tsdb.path=/var/lib/prometheus \
--web.listen-address=0.0.0.0:9090
Restart=always
[Install]
WantedBy=multi-user.target
EOF
sudo cp