The Monitoring page is where you watch for trouble and track recovery. It is organized into three tabs — Incidents, Recoveries, and Rules — with a summary strip across the top.
The strip at the top of the page answers three questions at a glance:
The Incidents tab lists automation failures. Use the filter chips to narrow the view:
Click a row to expand it. An expanded incident shows:
The Recoveries tab is a chronological timeline of every recovery action, grouped by day. Filter by AI actions, Auto-reruns, or Resolved, and export the view to CSV for reporting.
Each entry records the original automation, the attempt number, the outcome, and how long recovery took.
The Rules tab shows the monitoring rules protecting this Business Unit as a grid of cards. Each card shows the rule type (automation or folder), whether AI recovery is on, and stats — times triggered in the last 7 days, success rate, and average MTTR. Add a rule with the dashed Add rule card.
If no rule matches, the failure is still recorded on the Events page; it simply does not trigger automatic recovery.
You do not need a rule to retry something. From an incident (or an event) you can trigger a manual rerun. Nimbus queues it exactly like an automatic one — waits for the original run to settle, builds a trimmed automation, fires it, and reports the result.