obproxyd_process_stop

2024-03-22 09:34:34  Updated

Description

This alert is triggered when obproxyd.sh stops. obproxyd.sh is the daemon process for OBProxy.

The ocp_agent process monitors the uptime of the processes of components in OceanBase Database. When the uptime of a process is 0, the process is stopped. This alert indicates that the process has stopped at the moment rather than for a long time. In the latter case, other status alerts are triggered.

Relevant alerts:

  • observer process: observer_process_stop
  • obproxy process: obproxy_process_stop
  • obproxyd.sh process: obproxyd_process_stop
  • ocp_agentd process: agentd_process_stop
  • ocp_mgragent process: mgragent_process_stop
  • ocp_monagent process: monagent_process_stop

Principle

Parameter Value
Metric obproxyd_uptime_delta_seconds
Source The difference between the current time and the process creation time displayed in the 14th column of the stat file in the /proc/[pid] directory.
Collected metric process_uptime_seconds
Metric expression 0 - min(delta(process_uptime_seconds{name="obproxyd.sh",@LABELS}[@INTERVAL])) by (@GBLABELS)
Collection cycle 5 seconds

Alert rule

Metric expression Metric description Default threshold Detection cycle Time before clearance
obproxyd_uptime_delta_seconds > 0 The negative offset value of uptime is used. A metric value greater than 0 indicates that the process has stopped. 0 seconds 10 seconds 5 minutes

Alert information

Trigger method Alert level Scope
Based on the expression of the metric Warning OBProxy

Alert templates

  • Overview

    • Template: ${alarm_target} ${alarm_name}
    • Example: alarm_template_id=0:obproxy_cluster=ODPT2-2000005:host=xxx.xxx.xxx.xxx. The obproxyd.sh process stopped.
  • Details

    • Template: OBProxy cluster: ${obproxy_cluster}. Host: ${host}. Alert: ${alarm_name}. The process has been running for ${value_shown}.
    • Example: OBProxy cluster: ODPT2. Host: xxx.xxx.xxx.xxx. Alert: The obproxyd.sh process stopped. The process has been running for 1 day 15 hours 12 minutes 51 seconds.

Impact on the system

When the obproxy process stops or unexpectedly exits, it is restarted by the obproxyd.sh process. If the obproxyd.sh process cannot start, the obproxy process cannot be restarted. This results in prolonged service interruption.

You must ensure that OBProxy stays online as much as possible and provides services to minimize service interruption.

Possible causes

None.

Solutions

  1. Manually load the obproxyd.sh process. The restart of OBProxy causes business interruptions. Therefore, we recommend that you do not restart OBProxy in the OceanBase Cloud Platform (OCP) console.

    cd /opt/taobao/install/obproxy-3.4.0 && bin/obproxyd.sh -c start -r /home/admin/logs/obproxy -n ODPT2 -p 2883
    

Contact Us