Incident response
Every layer has a documented failure mode and a documented response. The principle: fail closed. No payment moves without a valid signed verdict.
Sanctions feed outage
Trigger: one of the six feeds returns stale data or 5xx for more than ten minutes.
Response:
- Veris hot path continues with remaining sources.
- WARN alert fires on the affected feed.
- If two feeds go down, all
risk_tierdecisions tighten one level (Medium becomes High, etc.) until restoration. - If three or more feeds go down, the engine fails closed: all attestations return
Blocked.
L3 Veris service down
Trigger: gRPC ping fails for more than thirty seconds.
Response:
- Pre-fail switch the engine to read-only mode (no new attestations).
- Pending payments queue with the verifier for retry.
- Operators run the Veris service restart playbook.
- Cached attestations within 60-second TTL continue to satisfy in-flight bundles.
L6 verifier downtime
Trigger: /verify/health returns 5xx for more than fifteen seconds.
Response:
- Networks switch to cached verifier pubkey + offline verification (see offline verification).
- The Solidity reference verifier on Base remains available.
- Operators restart the HTTP service.
Mass revocation event
Trigger: more than 100 Tier 1 revocations in a single flush cycle.
Response:
- Continue normal flush cadence (do not skip).
- CRIT alert fires.
- Operations reviews the trigger source (OFAC update, fraud incident, internal action).
- Communicate to subscribed networks via webhook.
Signing key compromise
Trigger: any signal that an MPC share has leaked.
Response:
- Immediate rotation per key rotation procedure.
- The old key continues to verify already-signed bundles within the one-hour overlap window.
- CRIT alert + post-mortem mandatory.
- Audit log entries marked for regulator notification.
On-chain anchor lag
Trigger: more than ninety minutes since the last anchor commit.
Response:
- WARN alert.
- Operators verify Base node connectivity.
- If the commit transaction failed, re-submit with higher gas.
- The audit chain itself is unaffected; only the on-chain confirmation lags.
Where to go next
- Monitoring for the probe surface.
- Key rotation for scheduled rotation.
- Deployment for the layer-by-layer architecture.