arbitration_service_unavailable

2024-08-23 10:14:19  Updated

Description

This alert is triggered when the arbitration service is unavailable.

Principle

Parameter Value
Metric arbitration_service_available
Source OCP-Agent checks whether the local observer process is started and whether the arbitration service port is occupied.
Collected metric arbitration_service_available
Metric expression sum(arbitration_service_available{@LABELS}) by (@GBLABELS)
Collection cycle 30 seconds

Alert rule

Metric expression Default threshold Duration Detection cycle Time before clearance
arbitration_service_available 0 0 seconds 30 seconds 5 minutes

Alert information

Trigger method Alert level Scope
Based on the expression of the metric Critical Arbitration service

Alert templates

  • Overview
    • Template: ${alarm_target} ${alarm_name}
    • Example: svr_ip=xxx.xxx.xxx.xxx:svr_port=2882 Arbitration service unavailable
  • Details
    • Template: Cluster: The arbitration service ${svr_ip}:${svr_port} is unavailable.
    • Example: Cluster: The arbitration service xxx.xxx.xxx.xxx:2882 is unavailable.

Impact on the system

If the arbitration service is unavailable, service reliability and high availability of the tenant are affected. If a tenant replica fails, the tenant cannot normally provide the arbitration service.

Possible causes

  • The server hosting the arbitration service is down.
  • The port of the arbitration service cannot be accessed externally because the firewall is enabled.

Solutions

  1. If the observer process of the server hosting the arbitration service unexpectedly exits, reload the observer process. You can execute the alert clearance plan to handle the alerted issue. For more information, see Execute the alert clearance plan.

    Notice

    You can reload the observer process only once within 12 hours since the event occurs.

  2. Check whether the server hosting the arbitration service runs normally. If the server fails, restart the server and restart the arbitration service in the OceanBase Cloud Platform (OCP) console.

  3. If the server is running normally, check whether the arbitration service process exists, whether the arbitration service port is listening normally, and whether the firewall prohibits access from other IP addresses.

Contact Us