修改/etc/hosts和/etc/sysconfig/network-scripts之后,开启GPFS不能成功
[root@gpfsfpo1 network-scripts]# mmstartup -a
get file failed: Unable to reach any quorum node (Check your firewall or network settings) (err 823)
gpfsClusterInit: Unexpected error from ccr fget mmsdrfs. Return code: 149
mmstartup: Command failed. Examine previous error messages to determine cause
[root@gpfsfpo1 network-scripts]# mmlscluster
get file failed: Unable to reach any quorum node (Check your firewall or network settings) (err 823)
gpfsClusterInit: Unexpected error from ccr fget mmsdrfs. Return code: 149
mmlscluster: Command failed. Examine previous error messages to determine cause.
[root@gpfsfpo1 network-scripts]# mmchnode --daemon-interface=gpfsfpo-daemon1 -N gpfsfpo1
mmchnode: Unable to obtain the GPFS configuration file lock.
mmchnode: GPFS was unable to obtain a lock from node gpfsfpo1.
mmchnode: Command failed. Examine previous error messages to determine cause.
[root@gpfsfpo1 network-scripts]# mmchcluster --ccr-disable
mmchcluster: Unable to obtain the GPFS configuration file lock.
mmchcluster: GPFS was unable to obtain a lock from node gpfsfpo1.
mmchcluster: Processing continues without lock protection.
mmchcluster: Cluster Export Services are enabled on the following nodes:
gpfsfpo1
gpfsfpo2
Cluster Export Services require CCR to be enabled.
Disable CCR is not allowed without --force option.
mmchcluster: Command failed. Examine previous error messages to determine cause.
[root@gpfsfpo1 network-scripts]# mmchcluster --ccr-disable --force
mmchcluster: Unable to obtain the GPFS configuration file lock.
mmchcluster: GPFS was unable to obtain a lock from node gpfsfpo1.
mmchcluster: Processing continues without lock protection.
Verifying GPFS is stopped on all nodes ...
mmchcluster: GPFS cluster configuration servers:
mmchcluster: Primary server: gpfsfpo1
mmchcluster: Secondary server: gpfsfpo2
mmchcluster: Command successfully completed
[root@gpfsfpo1 network-scripts]# mmlscluster
GPFS cluster information
========================
GPFS cluster name: gpfs.FPO
GPFS cluster id: 5710580052883819165
GPFS UID domain: gpfs.FPO
Remote shell command: /usr/bin/ssh
Remote file copy command: /usr/bin/scp
Repository type: server-based
GPFS cluster configuration servers:
-----------------------------------
Primary server: gpfsfpo1
Secondary server: gpfsfpo2
Node Daemon node name IP address Admin node name Designation
---------------------------------------------------------------------
1 gpfsfpo-daemon1 192.168.4.142 gpfsfpo1 quorum-manager
2 gpfsfpo-daemon2 192.168.4.144 gpfsfpo2 quorum-manager
[root@gpfsfpo1 network-scripts]# mmchnode --daemon-interface=gpfsfpo-daemon1 -N gpfsfpo1
Tue Aug 15 22:08:27 EDT 2017: mmchnode: Processing node gpfsfpo1
Verifying GPFS is stopped on all nodes ...
gpfsfpo2: mmremote: Unable to determine the local node identity.
gpfsfpo2: mmremote: Command failed. Examine previous error messages to determine cause.
mmchnode: Propagating the cluster configuration data to all
affected nodes. This is an asynchronous process.
[root@gpfsfpo1 network-scripts]# mmchnode --daemon-interface=gpfsfpo-daemon2 -N gpfsfpo2
mmremote: Unable to determine the local node identity.
mmremote: Command failed. Examine previous error messages to determine cause.
mmremote: Unable to determine the local node identity.
mmremote: Command failed. Examine previous error messages to determine cause.
mmchnode: syncServerNodes: Unable to obtain mmsdrfs version line from backup server.
mmchnode: Command failed. Examine previous error messages to determine cause.
按照http://nixys.fr/blog/?p=2152的方法
强行修改vi /var/mmfs/gen/mmsdrfs里面的ip地址
[root@gpfsfpo1 network-scripts]# mmstartup -a
Tue Aug 15 22:15:16 EDT 2017: mmstartup: Starting GPFS ...
[root@gpfsfpo1 network-scripts]# mmgetstate -a
Node number Node name GPFS state
------------------------------------------
1 gpfsfpo-daemon1 arbitrating
2 gpfsfpo-daemon2 arbitrating
[root@gpfsfpo1 network-scripts]# mmgetstate -a
Node number Node name GPFS state
------------------------------------------
1 gpfsfpo-daemon1 active
2 gpfsfpo-daemon2 active
集群可以被拉起了