ob_host_install_disk_percent_over_threshold

2023-08-22 02:46:05  Updated

Description

This alert is triggered when the usage of the disk mounted to the OBServer installation directory exceeds the threshold.

Note

The default installation directory of OBServer is/home/admin/oceanbase.

Principle

The following table describes the key parameters that are involved in the monitoring and alerting logic.

Parameter Value
Metric ob_host_install_path_disk_percent Note This metric indicates the disk usage in the OBServer installation directory. When the usage exceeds the threshold, this alert is triggered.
Source df -B1
Note
The metric source of this alert is special. OCP-Agent uses the preceding command to check the disk usage of OBServer and adds labels to disks. The labels vary with the disks:
  • Disk of the installation directory: mount_lable="install_path"
  • Disk of the data directory: mount_lable="data_path"
  • Disk of the log directory: mount_lable="log_path"
  • Collected metrics host_partition_volume_free and host_partition_volume_total
    Metric expression 100 * (1 - avg(host_partition_volume_free{@LABELS}) by (@GBLABELS) / avg(host_partition_volume_total{@LABELS}) by (@GBLABELS))
    Collection cycle 1 second

    Alert rule

    Metric Default threshold (unit: %) Duration Detection cycle Time before clearance
    ob_host_install_path_disk_percent 95 0 seconds 60 seconds 5 minutes

    Alert information

    Trigger method Alert level Scope
    Metric expression Critical Server

    Alert templates

    • Overview: ${alarm_target} ${alarm_name}

    • Details: Cluster: ${ob_cluster_name}, Host:${host}, Alert: ${alarm_name}. The disk usage in the installation directory ${install_path} of the ${mount_point} mount point is ${value}%, exceeding the threshold of ${alarm_threshold}%.

    • Overview example: ob_cluster=C1-1000:svr_ip=xxx.xxx.xxx.xxx. The disk usage in the OBServer installation directory exceeds the threshold.

    • Details example: Cluster: ob_cluster=C1-1000, Host:xxx.xxx.xxx.xxx, Alert: The disk usage in the OBServer installation directory exceeds the threshold. The disk usage in the installation directory /home/admin/oceanbase of the /home mount point is 98.0%, exceeding the threshold of 95.0%.

    Impact on the system

    The OBServer cannot write log files when the /home directory is full.

    Possible cause

    Too many files are generated by other applications.

    Suggested solution

    1. Run the df command to check whether the disk usage of the /home/ directory exceeds the threshold.

    2. If the disk usage exceeds the threshold, run the following command to find the directories and files that occupy the most space:

      # Find the five directories that occupy the most space.
      du -a /home/admin | sort -n -r | head -n 5
      
      # Find the five largest files.
      cd /home/admin && find -type f -exec du -Sh {} + | sort -rh | head -n 5
      
    3. Delete unwanted files.

    Contact Us