以下命令均在服务器kingbase用户下执行
1.查看集群状态:
repmgr cluster show
ID | Name | Role | Status | Upstream | Location | Priority | Timeline | LSN_Lag | Connection string
----+-------+---------+-----------+----------+----------+----------+----------+---------+-------------------------------------------------------------------------------------------------------------------------------------------------
1 | node1 | primary | * running | | default | 100 | 3 | | host=158.10.1.31 user=esrep dbname=esrep port=54321 connect_timeout=10 keepalives=1 keepalives_idle=10 keepalives_interval=1 keepalives_count=3
2 | node2 | standby | running | node1 | default | 100 | 3 | 0 bytes | host=158.10.1.32 user=esrep dbname=esrep port=54321 connect_timeout=10 keepalives=1 keepalives_idle=10 keepalives_interval=1 keepalives_count=3
如上为正常状态,node1为主节点,node2为备节点
ID | Name | Role | Status | Upstream | Location | Priority | Timeline | Connection string
----+---------+---------+-----------+----------+----------+----------+----------+---------------------------------------------------------------------------------------------------------------------------------------------------
1 | node1 | primary | - failed | | default | 100 | ? | host=158.10.1.31 user=system dbname=esrep port=54321 connect_timeout=10 keepalives=1 keepalives_idle=10 keepalives_interval=1 keepalives_count=3
2 | node2 | primary | * running | | default | 100 | 4 | host=158.10.1.32 user=system dbname=esrep port=54321 connect_timeout=10 keepalives=1 keepalives_idle=10 keepalives_interval=1 keepalives_count=3
如上为故障状态,node1为故障状态,node2为主节点
需要在故障节点上手动执行repmgr node rejoin -h 158.10.1.32 -Uesrep -d esrep --force-rewind命令将故障节点加入主节点(-h 后面跟的IP地址是主节点IP)
2.关机步骤:
在主节点上执行
sys_monitor.sh stop
然后关闭服务器
3.启动集群步骤:
两台服务器都开启完毕后,在任意一台服务器上执行
sys_monitor.sh start
然后查看集群状态
repmgr cluster show
4.数据库登录命令
ksql -Usystem -h 158.10.1.33 -d test
system为用户名,test为数据库名,158.10.1.33为虚拟IP地址
日志路径为/home/kingbase/cluster/kingbase/log/hamgr.log