Description
This alert is triggered when the OceanBase cluster has not been successfully backed up for a long period.
Principle
OCP-Server runs a timed task to check the data backup tasks every minute. The backup tasks include logical backup and physical backup tasks. When the last successful data backup task was n days ago, and n is greater than the threshold, this alert is triggered. Note
The default threshold is 9 days and is specified by the parameter ocp.backup.alarm.base-backup-last-finished-threshold. You can also specify the threshold by modifying the value of Alert Period for Failed Data Backups in the Configure Alert Threshold section of the backup strategy.
Alert rule
| Metric | Default threshold | Duration | Detection cycle | Time before clearance |
|---|---|---|---|---|
| None | 9 | 0 seconds | 60 seconds | 5 minutes |
Alert information
| Trigger method | Alert level | Scope |
|---|---|---|
| Reminder of OCP | Critical | Service |
Alert templates
Overview: ${alarm_target} ${alarm_name}
Details: ${alarm_description}
Overview example: ob_cluster=C1-1000. The cluster has not been successfully backed up for a long period.
Details example: obCluster: C1-1000, no successful data backup in 9 days, last successful time: 2020-01-05 02:00:00, current time: 2020-01-15 01:00:00, service ip: 127.0.0.1, error message:
Impact on the system
The data recovery may be slow or even fail.
Possible causes
The last manual backup task failed.
The scheduled backup task failed.
Suggested solutions
Failure of manual backup tasks.
You need to view the errors of the backup tasks or check the logs to identify the causes of error if necessary.
Log on to the OCP console. In the Clusters list on the Cluster Overview page, click the name of the target cluster.
In the left-side navigation pane, click Backup & Recovery .
Check for recently failed data backup tasks.
For a failed task, click View Cause in the Actions column to view the cause.
If no data backup task has been performed recently, go to the next step.
Failure of scheduled backup tasks.
Log on to the OCP console. In the Clusters list on the Cluster Overview page, click the name of the target cluster.
In the left-side navigation pane, click Backup & Recovery .
On the Backup & Recovery page, click View next to Backup Scheduling History to check for recently failed tasks. You can view the task log to identify the cause.