我们把server1 2 3的mysql都禁开机自启,然后作初始化更改密码,三个操作相同
[root@server3 ~]# chkconfig --list mysqld
mysqld 0:off 1:off 2:off 3:on 4:on 5:on 6:off
[root@server3 ~]# chkconfig mysqld off
[root@server3 ~]# /etc/init.d/mysqld stop
Stopping mysqld: [ OK ]
[root@server3 ~]# cd /var/lib/mysql
[root@server3 mysql]# rm -rf *
[root@server3 mysql]# vim /etc/my.cnf
server_id=1
gtid_mode=ON
enforce_gtid_consistency=ON
master_info_repository=TABLE
relay_log_info_repository=TABLE
log_slave_updates=ON
log_bin=binlog
binlog_format=ROW
[root@server3 mysql]# > /var/log/mysqld.log
[root@server3 mysql]# /etc/init.d/mysqld start
Initializing MySQL database: [ OK ]
Installing validate password plugin: [ OK ]
Starting mysqld: [ OK ]
[root@server3 mysql]# grep password /var/log/mysqld.log
2018-08-11T01:39:00.153736Z 1 [Note] A temporary password is generated for root@localhost: tC4/OznHHp9g
[root@server3 mysql]# mysql_secure_installation
然后做主从
Server1
mysql> grant replication slave on *.* to repl@'172.25.53.%' identified by 'YaoJing...123@';
mysql> reset master;
Server2,3
mysql> change master to master_host='172.25.53.1', master_user='repl',master_password='YaoJing...123@', master_auto_position=1;
Query OK, 0 rows affected, 2 warnings (0.32 sec)
mysql> start slave;
Query OK, 0 rows affected (0.36 sec)
mysql> show slave status\G
Server1(主库)建westos库
Server2,3(从库)可以同步到
MySQL高可用(MHA)
简介:
MHA(Master High Availability)目前在MySQL高可用方面是一个相对成熟的解决方案,它由日本DeNA公司youshimaton(现就职于Facebook公司)开发,是一套优秀的作为MySQL高可用性环境下故障切换和主从提升的高可用软件。在MySQL故障切换过程中,MHA能做到在0~30秒之内自动完成数据库的故障切换操作,并且在进行故障切换的过程中,MHA能在最大程度上保证数据的一致性,以达到真正意义上的高可用。
该软件由两部分组成:MHA Manager(管理节点)和MHA Node(数据节点)。MHA Manager可以单独部署在一台独立的机器上管理多个master-slave集群,也可以部署在一台slave节点上。MHA Node运行在每台MySQL服务器上,MHA Manager会定时探测集群中的master节点,当master出现故障时,它可以自动将最新数据的slave提升为新的master,然后将所有其他的slave重新指向新的master。整个故障转移过程对应用程序完全透明。
在MHA自动故障切换过程中,MHA试图从宕机的主服务器上保存二进制日志,最大程度的保证数据的不丢失,但这并不总是可行的。例如,如果主服务器硬件故障或无法通过ssh访问,MHA没法保存二进制日志,只进行故障转移而丢失了最新的数据。使用MySQL 5.5的半同步复制,可以大大降低数据丢失的风险。MHA可以与半同步复制结合起来。如果只有一个slave已经收到了最新的二进制日志,MHA可以将最新的二进制日志应用于其他所有的slave服务器上,因此可以保证所有节点的数据一致性。
目前MHA主要支持一主多从的架构,要搭建MHA,要求一个复制集群中必须最少有三台数据库服务器,一主二从,即一台充当master,一台充当备用master,另外一台充当从库,因为至少需要三台服务器,出于机器成本的考虑,淘宝也在该基础上进行了改造,目前淘宝TMHA已经支持一主一从。
1.关于Mha
MHA是一款开源的mysql的高可用程序,它为mysql主从复制架构了,master down掉之后,从slave中产生新的master的功能,MHA在监控到master数据节点服务器故障时,会提升其中拥有最新的数据的slave成为新的master。MHA主句节点和MHA管理节点,MHA Manager通常单独部署在一台服务器上,MHA Node通常在每台mysql服务器上;(1)把某一个slave节点提升为master节点;(2)在提升之前,会把所有其他slave节点记录的所有数据合并到要提升为主节点(master)的那个(slave)从节点上。
2.Mha组件
(1)Manager节点
masterha_check_ssh: MHA依赖ssh环境检测工具
masterha_check_repl: mysql复制环境检测工具
masterha_manager: MHA服务主程序
masterha_check_status:MHA运行状态探测工具
masterha_check_monitor: mysql master节点可用性检测工具
masterha_check_switch:master节点切换工具
masterha_conf_host:添加或删除配置节点
masterha_stop:关闭MHA服务
(2)Node节点
save_binary_logs: 保存和复制master的二进制日志
apply_diff_relay_logs:识别差异的中继日志事件并应用与其他slave
filter_mysqlbinlog:去除不必要的rollback事件
purge_relay_logs:清除中继日志(不会阻塞sql线程)
MHA Manager管理多组主从复制
[root@server1 ~]# ls
anaconda-ks.cfg mysql-community-client-5.7.17-1.el6.x86_64.rpm
install.log mysql-community-common-5.7.17-1.el6.x86_64.rpm
install.log.syslog mysql-community-libs-5.7.17-1.el6.x86_64.rpm
MHA mysql-community-libs-compat-5.7.17-1.el6.x86_64.rpm
mha4mysql-node-0.56-0.el6.noarch.rpm mysql-community-server-5.7.17-1.el6.x86_64.rpm
[root@server1 ~]# cd MHA/
[root@server1 MHA]# ls
master_ip_failover
master_ip_online_change
mha4mysql-manager-0.56-0.el6.noarch.rpm
mha4mysql-node-0.56-0.el6.noarch.rpm
perl-Config-Tiny-2.12-7.1.el6.noarch.rpm
perl-Email-Date-Format-1.002-5.el6.noarch.rpm
perl-Log-Dispatch-2.27-1.el6.noarch.rpm
perl-Mail-Sender-0.8.16-3.el6.noarch.rpm
perl-Mail-Sendmail-0.79-12.el6.noarch.rpm
perl-MIME-Lite-3.027-2.el6.noarch.rpm
perl-MIME-Types-1.28-2.el6.noarch.rpm
perl-Parallel-ForkManager-0.7.9-1.el6.noarch.rpm
send_report
[root@server1 MHA]# yum install -y *.rpm
[root@server1 MHA]# scp mha4mysql-node-0.56-0.el6.noarch.rpm server2:
[root@server1 MHA]# scp mha4mysql-node-0.56-0.el6.noarch.rpm server3:
[root@server2 ~]# rpm -ivh mha4mysql-node-0.56-0.el6.noarch.rpm
Preparing... ########################################### [100%]
package mha4mysql-node-0.56-0.el6.noarch is already installed
[root@server3 ~]# rpm -ivh mha4mysql-node-0.56-0.el6.noarch.rpm
Preparing... ########################################### [100%]
package mha4mysql-node-0.56-0.el6.noarch is already installed
[root@server1 MHA]# mkdir -p /etc/masterha
[root@server1 MHA]# cd /etc/masterha/
[root@server1 masterha]# ls
[root@server1 masterha]# ls
[root@server1 masterha]# vim app1.cnf
[server default]
manager_workdir=/etc/masterha/
manager_log=/etc/masterha/app1.log
master_binlog_dir=/var/lib/mysql
#master_ip_failover_script= /usr/local/bin/master_ip_failover
#master_ip_online_change_script= /usr/local/bin/master_ip_online_change
password=YaoJing...123@
user=root
ping_interval=1
remote_workdir=/tmp
repl_password=YaoJing...123@
repl_user=repl
#report_script=/usr/local/send_report
#secondary_check_script= /usr/local/bin/masterha_secondary_check -s server03 -s server02
#shutdown_script=""
ssh_user=root
[server1]
hostname=172.25.53.1
port=3306
[server2]
hostname=172.25.53.2
port=3306
#candidate_master=1
#check_repl_delay=0
[server3]
hostname=172.25.53.3
port=3306
[server default]
manager_workdir=/etc/masterha
//设置manager的工作目录
manager_log=/etc/masterha/mha.log
//设置manager的日志
master_binlog_dir=/var/lib/mysql
//设置master,保存binlog的位置,以便MHA可以找到master的日志
user=root
//设置监控用户root
password=’YaoJing...123@’
//root用户的密码
ping_interval=1
//设置监控主库,发送ping包的时间默认值为3s,尝试三次没有回复后自动进行railover
remote_workdir=/tmp
//设置远程mysql在发生切换时binlog的保存位置。
repl_user=haha
//设置同步用户haha
repl_password=’YaoJing...123@’
//设置同步用户的密码。
ssh_user=root
//设置ssh的登陆名。
[server1]
hostname=172.25.53.1
port=3306
[server2]
hostname=172.25.53.2
port=3306
candidate_master=1
//设置候选的master,设置此参数后,发生主从切换后会将此从库 提升为主库。
check_repl_delay=0
//默认情况下如果一个skave落后master 100M relay_logs的话,MHA将不会选该slave作为一个新的master
[server3]
hostname=172.25.53.3
port=3306
#no_master=1 //一定不会是master
Ssh免密链接
[root@server1 ~]# ssh-keygen
Generating public/private rsa key pair.
Enter file in which to save the key (/root/.ssh/id_rsa):
Enter passphrase (empty for no passphrase):
Enter same passphrase again:
Your identification has been saved in /root/.ssh/id_rsa.
Your public key has been saved in /root/.ssh/id_rsa.pub.
The key fingerprint is:
ed:60:3a:25:d7:c1:b2:94:cf:54:f3:95:b1:16:cb:f4 root@server1
The key's randomart image is:
+--[ RSA 2048]----+
| o ++|
| o . ooo=|
| + + .=E|
| . O . . |
| . S = |
| * o |
| o . |
| . |
| |
+-----------------+
[root@server1 ~]# ssh-copy-id server1
[root@server1 ~]# ssh-copy-id 172.25.53.1
[root@server1 ~]# scp -r .ssh/ server2:
[root@server1 ~]# scp -r .ssh/ server3:
[root@server1 MHA]# masterha_check_ssh --conf=/etc/masterha/app1.cnf
[root@server1 ~]# masterha_check_repl --conf=/etc/masterha/app1.cnf
遇到问题
Sat Aug 11 13:37:03 2018 - [error][/usr/share/perl5/vendor_perl/MHA/ServerManager.pm, ln301] Got MySQL error when connecting 172.25.53.3(172.25.53.3:3306) :1130:Host 'server1' is not allowed to connect to this MySQL server, but this is not a MySQL crash. Check MySQL server settings.
at /usr/share/perl5/vendor_perl/MHA/ServerManager.pm line 297
解决办法
Server1
mysql> grant all on *.* to root@'%' identified by 'YaoJing...123@';
Query OK, 0 rows affected, 1 warning (0.36 sec)
Server2
mysql> set global read_only=1;
Query OK, 0 rows affected (0.00 sec)
再次尝试
[root@server1 ~]# masterha_check_repl --conf=/etc/masterha/app1.cnf
遇到问题
Sat Aug 11 13:41:15 2018 - [error][/usr/share/perl5/vendor_perl/MHA/Server.pm, ln393] 172.25.53.2(172.25.53.2:3306): User repl does not exist or does not have REPLICATION SLAVE privilege! Other slaves can not start replication from this host.
解决方法
Server1
mysql> grant replication slave on *.* to repl@'172.25.53.%' identified by 'YaoJing...123@';
Query OK, 0 rows affected, 1 warning (0.36 sec)
再次检测
[root@server1 ~]# masterha_check_repl --conf=/etc/masterha/app1.cnf
成功
查看此时master在1上
手动将master切到2上
[root@server1 ~]# masterha_master_switch --conf=/etc/masterha/app1.cnf --master_state=alive --new_master_host=172.25.53.2 --new_master_port=3306 --orig_master_is_new_slave
切换不成功,1上状态正常
但3上io显示为no
错误显示为
Last_Errno: 1062
Last_Error: Could not execute Write_rows event on table mysql.plugin; Duplicate entry 'validate_password' for key 'PRIMARY', Error_code: 1062; handler error HA_ERR_FOUND_DUPP_KEY; the event's master log binlog.000002, end_log_pos 418
解决办法: 在master(2)上重置master
然后在3(io)有问题的slave上重置slave
状态就正常了
在master上建表
3可以同步
1又变得不正常了
Last_IO_Errno: 1236
Last_IO_Error: Got fatal error 1236 from master when reading data from binary log: 'Slave has more GTIDs than the master has, using the master's SERVER_UUID. This may indicate that the end of the binary log was truncated or that the last binary log file was lost, e.g., after a power or disk failure when sync_binlog != 1. The master may or may not have rolled back transactions that were already replica'
解决方法 slave 上都
mysql> stop slave;
mysql> reset master;
mysql> reset slave;
mysql> start slave;
正常了
然后建表及切换都正常了
高可用检测
在server1上开启master—manager
[root@server1 MHA]# nohup masterha_manager --conf=/etc/masterha/mah.conf --ignore_last_failover 2>&1
nohup: ignoring input and appending output to `nohup.out'
关闭server1(master)
关闭之前
[root@server1 ~]# /etc/init.d/mysqld stop
Stopping mysqld: [ OK ]
查看状态
```

<div class="se-preview-section-delimiter"></div>
将server1上的mysql重启后,它不能自动加入到集群里,而且他的状态默认还是master
这时,就要手动更改他的状态,将他加入到集群;
mysql> change master to master_host=’172.25.53.2’,master_user=’haha’,master_password=’YaoJing…123@’,master_auto_position=1;
Query OK, 0 rows affected, 2 warnings (0.56 sec)
mysql> start slave;
Query OK, 0 rows affected (0.03 sec)
mysql> show slave status\G
“`
将server1上的mysql重启后,它不能自动加入到集群里,而且他的状态默认还是master
这时,就要手动更改他的状态,将他加入到集群;
mysql> change master to master_host='172.25.53.2',master_user='haha',master_password='YaoJing...123@',master_auto_position=1;
Query OK, 0 rows affected, 2 warnings (0.56 sec)
mysql> start slave;
Query OK, 0 rows affected (0.03 sec)
mysql> show slave status\G
手动切换:
//热切换masterha_master_switch --conf=/etc/master/mah.conf--master_state=alive --new_master_host=172.25.53.2 --new_master_port=3306 --orig_master_is_new_slave --running_updates_limit=1000
(切换到2)
//冷切换masterha_master_switch --conf=/etc/master/mah.conf--master_state=dead --dead_master_host=172.25.53.1 --dead_master_port=3306 --new_master_host=172.25.53.2 --new_master_port=3306 --ignore_last_failover
(从1切换到2)