MHA構築時に困った('A`)
[server default]
# mysql user and password
user=test
password=test
ssh_user=mysql# working directory on the manager
manager_workdir=/var/log/masterha/hoge# working directory on MySQL servers
remote_workdir=/var/log/masterha/hoge# binary_log path
master_binlog_dir=/var/lib/mysql# Master vip Failover
master_ip_failover_script=/etc/mha/master_ip_failover_hoge[server1]
hostname=192.168.0.100
candidate_master=1[server2]
hostname=192.168.0.101
candidate_master=1[server3]
hostname=192.168.0.102
candidate_master=1[server4]
hostname=192.168.0.103
candidate_master=1
## backup server
[server5]
hostname=192.168.0.104
no_master=1
[mysql@hoge-mha-manager01 ~]$ masterha_check_ssh --conf=/etc/hoge.cnf
Wed Jan 22 22:57:05 2014 - [warning] Global configuration file /etc/masterha_default.cnf not found. Skipping.
Wed Jan 22 22:57:05 2014 - [info] Reading application default configurations from /etc/hoge.cnf..
Wed Jan 22 22:57:05 2014 - [info] Reading server configurations from /etc/hoge.cnf..
Wed Jan 22 22:57:05 2014 - [info] Starting SSH connection tests..
Wed Jan 22 22:57:06 2014 - [debug]
Wed Jan 22 22:57:05 2014 - [debug] Connecting via SSH from mysql@192.168.0.100(192.168.0.100:22) to mysql@192.168.0.101(192.168.0.101:22)..
Wed Jan 22 22:57:05 2014 - [debug] ok.
〜略〜
Wed Jan 22 22:57:08 2014 - [info] All SSH connection tests passed successfully.
[mysql@hoge-mha-manager01 ~]$ masterha_check_repl --conf=etc/hoge.cnf
Wed Jan 22 22:59:19 2014 - [warning] Global configuration file /etc/masterha_default.cnf not found. Skipping.
Wed Jan 22 22:59:19 2014 - [info] Reading application default configurations from /etc/hoge.cnf..
Wed Jan 22 22:59:19 2014 - [info] Reading server configurations from /etc/hoge.cnf..
Wed Jan 22 22:59:19 2014 - [info] MHA::MasterMonitor version 0.55.
Wed Jan 22 22:59:19 2014 - [info] Dead Servers:
Wed Jan 22 22:59:19 2014 - [info] Alive Servers:
〜略〜
Wed Jan 22 22:59:19 2014 - [info] Current Alive Master: 192.168.0.100(192.168.0.100:3306)
Wed Jan 22 22:59:19 2014 - [info] Checking slave configurations..
Wed Jan 22 22:59:19 2014 - [warning] relay_log_purge=0 is not set on slave 192.168.0.101(192.168.0.101:3306).
Wed Jan 22 22:59:19 2014 - [warning] relay_log_purge=0 is not set on slave 192.168.0.102(192.168.0.102:3306).
Wed Jan 22 22:59:19 2014 - [warning] relay_log_purge=0 is not set on slave 192.168.0.103(192.168.0.103:3306).
Wed Jan 22 22:59:19 2014 - [warning] relay_log_purge=0 is not set on slave 192.168.0.104(192.168.0.104:3306).
Wed Jan 22 22:59:19 2014 - [info] Checking replication filtering settings..
Wed Jan 22 22:59:19 2014 - [info] binlog_do_db= , binlog_ignore_db=
Wed Jan 22 22:59:19 2014 - [info] Replication filtering check ok.
Wed Jan 22 22:59:19 2014 - [info] Starting SSH connection tests..
Wed Jan 22 22:59:22 2014 - [error][/usr/share/perl5/vendor_perl/MHA/MasterMonitor.pm, ln386] Error happend on checking configurations. SSH Configuration Check Failed!
at /usr/share/perl5/vendor_perl/MHA/MasterMonitor.pm line 341
Wed Jan 22 22:59:22 2014 - [error][/usr/share/perl5/vendor_perl/MHA/MasterMonitor.pm, ln482] Error happened on monitoring servers.
Wed Jan 22 22:59:22 2014 - [info] Got exit code 1 (Not master dead).MySQL Replication Health is NOT OK!
masterha_check_sshはOKになるし、実際にsshしてもノンパスで通るのにmasterha_check_replでSSH Configuration Check Failed!って怒られる('A`)
絶賛答えが見つからないところ・・・
【2014/01/22 23:14 追記】
http://mizzy.org/blog/2013/02/06/1/:MHA for MySQL の概要というページを見つけ、masterha_managerというコマンドがあると言うことに気がついた。
(起動スクリプトの中にがっつり書いてあるんですが・・・)
[mysql@hoge-mha-manager01 ~]$ masterha_manager --conf=/etc/hoge.cnf
Wed Jan 22 23:13:34 2014 - [warning] Global configuration file /etc/masterha_default.cnf not found. Skipping.
Wed Jan 22 23:13:34 2014 - [info] Reading application default configurations from /etc/hoge.cnf..
Wed Jan 22 23:13:34 2014 - [info] Reading server configurations from /etc/hoge.cnf..
Wed Jan 22 23:13:34 2014 - [info] MHA::MasterMonitor version 0.55.
Wed Jan 22 23:13:34 2014 - [error][/usr/share/perl5/vendor_perl/MHA/MasterMonitor.pm, ln386] Error happend on checking configurations. Permission denied:/var/log/masterha/hoge/hoge.master_status.health at /usr/share/perl5/vendor_perl/MHA/MasterMonitor.pm line 294
Wed Jan 22 23:13:34 2014 - [error][/usr/share/perl5/vendor_perl/MHA/MasterMonitor.pm, ln482] Error happened on monitoring servers.
Wed Jan 22 23:13:34 2014 - [info] Got exit code 1 (Not master dead).
Wed Jan 22 23:13:34 2014 - [error][/usr/share/perl5/vendor_perl/MHA/ManagerUtil.pm, ln178] Got Error on finalize_on_error at monitor: Permission denied:/var/log/masterha/hoge/hoge.master_status.health at /usr/share/perl5/vendor_perl/MHA/MasterMonitor.pm line 591
# ll /var/log/masterha
d--------- 2 mysql mysql 4096 Jan 22 23:11 hoge
(・∀・)
( ꒪⌓꒪)
(´つω・`)シュン
はい、chefでディレクトリ掘ったので気がついてなかったのですが、recipeでパーミッション間違ってました(´;ω;`)ブワッ
すいません、ただのクズです・・・
$ masterha_check_repl --conf=/etc/hoge.cnf
〜略〜
Wed Jan 22 23:18:59 2014 - [info] Got exit code 0 (Not master dead).MySQL Replication Health is OK.
通った(´;ω;`)ブワッ
スタートしたら
プロセスも確認できたので大丈夫でした。
Checking MySQL-MHA
- Check SSH : OK
- Check MySQL-Replication : OK
マジでクズです。これに2時間も時間かけたと思うとアホすぎる・・・