domain | shortname | name | status | editor |
---|---|---|---|---|
github.com |
13/HALC |
LDR1 HA, Loosely Coupled |
draft |
Valery V. Vorotyntsev <[email protected]> |
The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be interpreted as described in RFC 2119.
Sequence of events:
- s3server gets an I/O error.
- s3server puts an IEM into the syslog.
- SSPL is monitoring the syslog. It notices an IEM in the syslog, generates an alert message and passes it to RabbitMQ.
- CSM fetches the alert from RabbitMQ.
- CSM invokes the Decision Maker (DM).
- DM filters out unimportant alerts.
- DM applies "rules": if the alert matches predefined criteria, a corresponding action is taken.
- DM's action is to invoke the "HA tool" (
hctl
). - HA tool invokes Pacemaker (e.g.,
sudo pcs restart s3server-resource-name
).