MHA構築時に困った('A`)


[server default]
# mysql user and password
user=test
password=test
ssh_user=mysql

# working directory on the manager
manager_workdir=/var/log/masterha/hoge

# working directory on MySQL servers
remote_workdir=/var/log/masterha/hoge

# binary_log path
master_binlog_dir=/var/lib/mysql

# Master vip Failover
master_ip_failover_script=/etc/mha/master_ip_failover_hoge

[server1]
hostname=192.168.0.100
candidate_master=1

[server2]
hostname=192.168.0.101
candidate_master=1

[server3]
hostname=192.168.0.102
candidate_master=1

[server4]
hostname=192.168.0.103
candidate_master=1
## backup server
[server5]
hostname=192.168.0.104
no_master=1



[mysql@hoge-mha-manager01 ~]$ masterha_check_ssh --conf=/etc/hoge.cnf
Wed Jan 22 22:57:05 2014 - [warning] Global configuration file /etc/masterha_default.cnf not found. Skipping.
Wed Jan 22 22:57:05 2014 - [info] Reading application default configurations from /etc/hoge.cnf..
Wed Jan 22 22:57:05 2014 - [info] Reading server configurations from /etc/hoge.cnf..
Wed Jan 22 22:57:05 2014 - [info] Starting SSH connection tests..
Wed Jan 22 22:57:06 2014 - [debug]
Wed Jan 22 22:57:05 2014 - [debug] Connecting via SSH from mysql@192.168.0.100(192.168.0.100:22) to mysql@192.168.0.101(192.168.0.101:22)..
Wed Jan 22 22:57:05 2014 - [debug] ok.
〜略〜
Wed Jan 22 22:57:08 2014 - [info] All SSH connection tests passed successfully.


[mysql@hoge-mha-manager01 ~]$ masterha_check_repl --conf=etc/hoge.cnf
Wed Jan 22 22:59:19 2014 - [warning] Global configuration file /etc/masterha_default.cnf not found. Skipping.
Wed Jan 22 22:59:19 2014 - [info] Reading application default configurations from /etc/hoge.cnf..
Wed Jan 22 22:59:19 2014 - [info] Reading server configurations from /etc/hoge.cnf..
Wed Jan 22 22:59:19 2014 - [info] MHA::MasterMonitor version 0.55.
Wed Jan 22 22:59:19 2014 - [info] Dead Servers:
Wed Jan 22 22:59:19 2014 - [info] Alive Servers:
〜略〜
Wed Jan 22 22:59:19 2014 - [info] Current Alive Master: 192.168.0.100(192.168.0.100:3306)
Wed Jan 22 22:59:19 2014 - [info] Checking slave configurations..
Wed Jan 22 22:59:19 2014 - [warning] relay_log_purge=0 is not set on slave 192.168.0.101(192.168.0.101:3306).
Wed Jan 22 22:59:19 2014 - [warning] relay_log_purge=0 is not set on slave 192.168.0.102(192.168.0.102:3306).
Wed Jan 22 22:59:19 2014 - [warning] relay_log_purge=0 is not set on slave 192.168.0.103(192.168.0.103:3306).
Wed Jan 22 22:59:19 2014 - [warning] relay_log_purge=0 is not set on slave 192.168.0.104(192.168.0.104:3306).
Wed Jan 22 22:59:19 2014 - [info] Checking replication filtering settings..
Wed Jan 22 22:59:19 2014 - [info] binlog_do_db= , binlog_ignore_db=
Wed Jan 22 22:59:19 2014 - [info] Replication filtering check ok.
Wed Jan 22 22:59:19 2014 - [info] Starting SSH connection tests..
Wed Jan 22 22:59:22 2014 - [error][/usr/share/perl5/vendor_perl/MHA/MasterMonitor.pm, ln386] Error happend on checking configurations. SSH Configuration Check Failed!
at /usr/share/perl5/vendor_perl/MHA/MasterMonitor.pm line 341
Wed Jan 22 22:59:22 2014 - [error][/usr/share/perl5/vendor_perl/MHA/MasterMonitor.pm, ln482] Error happened on monitoring servers.
Wed Jan 22 22:59:22 2014 - [info] Got exit code 1 (Not master dead).

MySQL Replication Health is NOT OK!

masterha_check_sshはOKになるし、実際にsshしてもノンパスで通るのにmasterha_check_replでSSH Configuration Check Failed!って怒られる('A`)
絶賛答えが見つからないところ・・・


【2014/01/22 23:14 追記】

http://mizzy.org/blog/2013/02/06/1/:MHA for MySQL の概要というページを見つけ、masterha_managerというコマンドがあると言うことに気がついた。
(起動スクリプトの中にがっつり書いてあるんですが・・・)


[mysql@hoge-mha-manager01 ~]$ masterha_manager --conf=/etc/hoge.cnf
Wed Jan 22 23:13:34 2014 - [warning] Global configuration file /etc/masterha_default.cnf not found. Skipping.
Wed Jan 22 23:13:34 2014 - [info] Reading application default configurations from /etc/hoge.cnf..
Wed Jan 22 23:13:34 2014 - [info] Reading server configurations from /etc/hoge.cnf..
Wed Jan 22 23:13:34 2014 - [info] MHA::MasterMonitor version 0.55.
Wed Jan 22 23:13:34 2014 - [error][/usr/share/perl5/vendor_perl/MHA/MasterMonitor.pm, ln386] Error happend on checking configurations. Permission denied:/var/log/masterha/hoge/hoge.master_status.health at /usr/share/perl5/vendor_perl/MHA/MasterMonitor.pm line 294
Wed Jan 22 23:13:34 2014 - [error][/usr/share/perl5/vendor_perl/MHA/MasterMonitor.pm, ln482] Error happened on monitoring servers.
Wed Jan 22 23:13:34 2014 - [info] Got exit code 1 (Not master dead).
Wed Jan 22 23:13:34 2014 - [error][/usr/share/perl5/vendor_perl/MHA/ManagerUtil.pm, ln178] Got Error on finalize_on_error at monitor: Permission denied:/var/log/masterha/hoge/hoge.master_status.health at /usr/share/perl5/vendor_perl/MHA/MasterMonitor.pm line 591

なんかログディレクトリでパーミッションがないって言ってる


# ll /var/log/masterha
d--------- 2 mysql mysql 4096 Jan 22 23:11 hoge

(・∀・)
( ꒪⌓꒪)
(´つω・`)シュン

はい、chefでディレクトリ掘ったので気がついてなかったのですが、recipeでパーミッション間違ってました(´;ω;`)ブワッ
すいません、ただのクズです・・・


$ masterha_check_repl --conf=/etc/hoge.cnf
〜略〜
Wed Jan 22 23:18:59 2014 - [info] Got exit code 0 (Not master dead).

MySQL Replication Health is OK.

通った(´;ω;`)ブワッ

スタートしたら


Checking MySQL-MHA
- Check SSH : OK
- Check MySQL-Replication : OK
プロセスも確認できたので大丈夫でした。

マジでクズです。これに2時間も時間かけたと思うとアホすぎる・・・