products:promonitor:6.7:monitorsguide:hana:hanareplicationstatus

HANA replication status

Purpose

System replication in HANA is a key component to be resilient against hardware failures or data corruption. This monitor will check that the replication system is in the expected state, during its nominal and transition phases (init,sync). It will also check for reconnects and fail overs and notifies as soon as those situations are detected.

Configuration hints

To start monitoring replication, create a new rule in the surveillance table. You can filter on the host and port number of the instance to customize the monitoring for specific nodes.

Expected mode: Define the expected mode of the replication, sends alarm if different

Error severity: Check for the replication status as visible in the system and send an alarm if in error

Fully recoverable: Check for the fully recoverable status as visible in the system and send an alarm if not as expected

Max init/sync/unknown time Check for the time spent in the corresponding state and send an alarm if the replication systems stays in that state for too long.

Reconnect/Failover count Check for reconnect or fail over situations. If the delta between now and last check is over threshold, send an alarm.

Surveillance table

ParameterDescription
ActiveTo enable/disable a rule.
HostA filter on the host name of the instance
PortA filter on the port number of the instance
Expected modeDefine the expected mode of the replication
Error SeverityIf not DISABLED, send an alarm if the replication status is in error.
Fully recoverable Severityif not DISABLED, send an alarm if the fully recoverable state is not true.
Max init timeThe maximum time spent in init state
Max sync timeThe maximum time spent in sync state
Max UNKNOWN timeThe maximum time spent in unknown state
Max reconnect countThe maximum reconnects occurred since last check
Max failover countThe maximum failovers occurred since last check
Auto clearIf checked, the alarm will be cleared as soon as the alarm condition is not met anymore.
Alarm tagThis field allows to add custom text within the alarm message. %MSG% variable will containthe actual generated message and can be used such as: “my_prefix %MSG% my_suffix”. By default, tagwill be used as prefix.
enable AlarmTo enable alarm sending
enable QOSTo enable metric sending

Generated metrics

metricIdmetricUnitmetricTargetmetricDescription
REPLICATION_STATUSBooleanSITE/HOST/PORTSend TRUE if the replication status is OK, FALSE instead
REPLICATION_IS_SECONDARY_ACTIVEBooleanSITE/HOST/PORT
REPLICATION_IS_SECONDARY_FULLY_RECOVERABLEBooleanSITE/HOST/PORTSend TRUE if the fully recoverable status is TRUE, FALSE instead
REPLICATION_RECONNECT_COUNTReconnectsSITE/HOST/PORTSends the number of reconnects since last check
REPLICATION_FAILOVER_COUNTFailoversSITE/HOST/PORTSends the number of failovers since last check
/home/clients/8c48b436badcd3a0bdaaba8c59a54bf1/wiki-web/data/pages/products/promonitor/6.7/monitorsguide/hana/hanareplicationstatus.txt · Last modified: 2019/02/11 15:26 (external edit)