数据库

解决MySQL主从复制错误1595:Relay log write failure…

Jager · 3月16日 · 2019年 · · 6441次已读

先交代下背景:我这边正在研究基于 Docker 的 MariaDB Galera Cluster 的自动化集群部署方案【以下简称 MGC,后续有时间可以考虑分享下这个方案】,已经完成所有调试,想在生产环境部署一个 MGC 集群作为从先灰度测试。解决MySQL主从复制错误1595:Relay log write failure...

生产环境主 DB 版本为 MySQL5.5,新的 MGC 采用 Mariadb 最新 10.3.12 stable 版本,做好 MGC 集群,并导入一份从主 DBdump 出来的完整 SQL 之后,change master 开始创建主从,结果如下报错:

MariaDB [(none)]> show slave status\G                                                                                                                                                      
*************************** 1. row ***************************
                Slave_IO_State: 
                   Master_Host: x.x.x.x
                   Master_User: rpl
                   Master_Port: 3306
                 Connect_Retry: 60
               Master_Log_File: mysql-bin.001411
           Read_Master_Log_Pos: 360808406
                Relay_Log_File: mysql-relay-bin.000001
                 Relay_Log_Pos: 4
         Relay_Master_Log_File: mysql-bin.001411
              Slave_IO_Running: No
             Slave_SQL_Running: Yes
               Replicate_Do_DB: 
           Replicate_Ignore_DB: 
            Replicate_Do_Table: 
        Replicate_Ignore_Table: 
       Replicate_Wild_Do_Table: 
   Replicate_Wild_Ignore_Table: 
                    Last_Errno: 0
                    Last_Error: 
                  Skip_Counter: 0
           Exec_Master_Log_Pos: 360808406
               Relay_Log_Space: 256
               Until_Condition: None
                Until_Log_File: 
                 Until_Log_Pos: 0
            Master_SSL_Allowed: No
            Master_SSL_CA_File: 
            Master_SSL_CA_Path: 
               Master_SSL_Cert: 
             Master_SSL_Cipher: 
                Master_SSL_Key: 
         Seconds_Behind_Master: NULL
 Master_SSL_Verify_Server_Cert: No
                 Last_IO_Errno: 1595
                 Last_IO_Error: Relay log write failure: could not queue event from master
                Last_SQL_Errno: 0
                Last_SQL_Error: 
   Replicate_Ignore_Server_Ids: 
              Master_Server_Id: 15410
                Master_SSL_Crl: 
            Master_SSL_Crlpath: 
                    Using_Gtid: No
                   Gtid_IO_Pos: 
       Replicate_Do_Domain_Ids: 
   Replicate_Ignore_Domain_Ids: 
                 Parallel_Mode: conservative
                     SQL_Delay: 0
           SQL_Remaining_Delay: NULL
       Slave_SQL_Running_State: Slave has read all relay log; waiting for the slave I/O thread to update it
              Slave_DDL_Groups: 0
Slave_Non_Transactional_Groups: 0
    Slave_Transactional_Groups: 0
1 row in set (0.00 sec)

查了下资料,都是是磁盘满导致的,但是我这边磁盘空间、权限都没问题,于是继续检查了下 MGC 节点日志如下:

2019-01-28 10:47:26 11 [Note] Slave SQL thread initialized, starting replication in log 'mysql-bin.001411' at position 512281317, relay log './mysql-relay-bin.000003' position: 151473377
2019-01-28 10:47:26 10 [Note] Slave I/O thread: connected to master '[email protected]:3306',replication started in log 'mysql-bin.001413' at position 412624945
2019-01-28 10:47:26 10 [Warning] Slave I/O: Notifying master by SET @master_binlog_checksum= @@global.binlog_checksum failed with error: Unknown system variable 'binlog_checksum', Internal MariaDB error code: 1193
2019-01-28 10:47:26 10 [ERROR] Slave I/O: Replication event checksum verification failed while reading from network, Internal MariaDB error code: 1743
2019-01-28 10:47:26 10 [ERROR] Slave I/O: Relay log write failure: could not queue event from master, Internal MariaDB error code: 1595
2019-01-28 10:47:26 10 [Note] Slave I/O thread exiting, read up to log 'mysql-bin.001413', position 412624945

里面有一个关键信息:binlog_checksum failed,看来报错和这个有关系了。查了下资料,应该是 Mariadb 默认打开了 slave_sql_verify_checksum(MySQL 版本默认关闭),而主从版本又不一致,导致 checksum 失败。

解决办法也很简单,在 MGC 各节点的[msyqld]配置中加上:slave_sql_verify_checksum=0 ,关闭这个特性即可,具体有没有负面影响暂未深入研究。

11 条回应
  1. 老部落运维 2019-3-18 · 14:41

    想与贵站交换首页链接,不知可否?
    我站:https://www.laobuluo.com/
    站名:老部落运维

  2. 春色集 2019-3-18 · 17:31

    学习了感谢博主

  3. hack520.com 2019-3-18 · 23:59

    最新也在学习 Docker,这东西功能很强大!

  4. 锋言锋语 2019-3-20 · 15:58

    大神的博客文章我都看不懂,悲哀!

  5. 小龙 2019-5-7 · 14:15

    zhangge请教这个评论插件叫啥?还有你越来越懒了~文章好敷衍啊。hh

    • avatar
      Jager 2019-5-17 · 9:26

      1、没插件,自己折腾代码。
      2、写的详细就说啰嗦,写得简洁就说敷衍,你让我情何以堪?

  6. 吴先森 2019-6-15 · 23:38

    写的不错,不过想问个问题,,,为什么评论的昵称等等数据不能自动保存啊,,每次想评论都要重新填一次,有点小麻烦,,还有,表情很老旧了,建议一换

    • avatar
      Jager 2019-6-16 · 23:20

      怕是你的浏览器开了无痕模式,我测试是没啥问题;表情有空再说

      • 吴先森 2019-6-22 · 16:43

        发现真的是在无痕标签下打开的,尴尬,

        • 吴先森 2019-6-22 · 16:45

          呸,浏览器一直在无痕模式下,,,,以前用过一次结果忘关了,

  7. Czd926 2021-5-24 · 18:04

    (none)> show slave status\G;
    *************************** 1. row ***************************
    Slave_IO_State:
    Master_Host: 10.0.0.2
    Master_User: repl
    Master_Port: 6320
    Connect_Retry: 60
    Master_Log_File: mysql-bin.043224
    Read_Master_Log_Pos: 79696147
    Relay_Log_File: mysql-relay-bin.018076
    Relay_Log_Pos: 95557340
    Relay_Master_Log_File: mysql-bin.039282
    Slave_IO_Running: No
    Slave_SQL_Running: Yes
    Replicate_Do_DB:
    Replicate_Ignore_DB: mysql
    Replicate_Do_Table:
    Replicate_Ignore_Table:
    Replicate_Wild_Do_Table:
    Replicate_Wild_Ignore_Table: mysql.%
    Last_Errno: 0
    Last_Error:
    Skip_Counter: 0
    Exec_Master_Log_Pos: 95557127
    Relay_Log_Space: 413880030478
    Until_Condition: None
    Until_Log_File:
    Until_Log_Pos: 0
    Master_SSL_Allowed: No
    Master_SSL_CA_File:
    Master_SSL_CA_Path:
    Master_SSL_Cert:
    Master_SSL_Cipher:
    Master_SSL_Key:
    Seconds_Behind_Master: 178016
    Master_SSL_Verify_Server_Cert: No
    Last_IO_Errno: 1595
    Last_IO_Error: Relay log write failure: could not queue event from master
    Last_SQL_Errno: 0
    Last_SQL_Error:
    Replicate_Ignore_Server_Ids:
    Master_Server_Id: 16548605
    Master_UUID: 14dd4adb-7d8f-11eb-a69c-00163e349260
    Master_Info_File: mysql.slave_master_info
    SQL_Delay: 0
    SQL_Remaining_Delay: NULL
    Slave_SQL_Running_State: Reading event from the relay log
    Master_Retry_Count: 86400
    Master_Bind:
    Last_IO_Error_Timestamp: 210524 17:59:44
    Last_SQL_Error_Timestamp:
    Master_SSL_Crl:
    Master_SSL_Crlpath:
    Retrieved_Gtid_Set: 14dd4adb-7d8f-11eb-a69c-00163e349260:977755372-1150597417
    Executed_Gtid_Set: 12420282-f8c5-11ea-a11f-b4055d249664:1-28713249,
    14dd4adb-7d8f-11eb-a69c-00163e349260:1-1074140197:1150597417-1150618144,
    46fee754-80ce-11eb-9198-00163e145841:1-214,
    5bf6c310-f8c5-11ea-b936-b4055d240812:1-2293
    Auto_Position: 1
    Replicate_Rewrite_DB:
    Channel_Name:
    Master_TLS_Version:
    1 row in set (0.00 sec)

    ERROR:
    No query specified

    文章里面提到的是log_file+log_pos方式进行主从同步,我当前环境是启用了gtid同步方式,mysql.57 按照博主提供的方式,问题依然存在,Slave_IO_Running依然是no,目前还在消化relaylog,并没有从Master获取新的binlog