Running Effective Post-Outage Retro Sessions for Infrastructure Teams
Designed for Senior Site Reliability Engineers and Infrastructure Team Leads who are responsible for facilitating post-outage retrospectives in high-stakes, production environments to spark real collaboration and high-energy learning.
A 90-minute hybrid workshop designed for technical leaders facing mounting pressure after high-impact outages. Teams are distributed, often juggling incident fatigue, and wary of 'retro' becoming a finger-pointing exercise rather than a learning moment. Session includes both in-person and remote participants using collaborative tools.
Outage Mystery Icebreaker
Kick off with a short, anonymized outage scenario that omits key facts. Present the group with only symptoms and ask: 'What are your first questions?' This primes curiosity while highlighting the complexity of post-mortem work.
Tap to view the full activity.
Why this works
Curiosity primes engagement and helps participants loosen up before diving into their own incidents. The 'missing details' prompt models how retros require inquiry, not immediate answers.
“Blame Game” Debunk
Present two contrasting statements: 'Outages are always someone's fault' vs. 'Most outages stem from system complexity.' Invite rapid polls or hands-up to choose which they agree with, then reveal research showing systemic causes dominate.
Tap to view the full activity.
Why this works
Quickly surfaces misconceptions and sets the tone for psychological safety—critical for productive retros.
Emoji Outage Check-in
Invite everyone to select an emoji (or physical card) representing how they felt during the last outage retro: 😬, 🤔, 😇, 😠, 😴. Share anonymously via chat or sticky notes. Facilitator reads a few aloud, normalizing diverse responses without pressure.
Tap to view the full activity.
Why this works
Low-stakes entry lets everyone express themselves, especially quieter voices. Emotional check-ins build trust and psychological safety.
Rapid Timeline Relay
Break into small groups (or virtual breakout rooms). Each group builds a quick timeline of an outage using sticky notes (physical or digital), passing the notes to the next person every 30 seconds, adding new details or questions. End with a fast-paced review.
Tap to view the full activity.
Why this works
High-energy collaboration turns a potentially dry task into a dynamic, kinetic mapping—activates all voices and uncovers fresh perspectives.
Choose Your Outage Adventure
Present a real-life dilemma: e.g., 'You discover the root cause is a misconfigured script by a junior engineer. Do you: A) Share the finding openly, B) Limit discussion, C) Focus only on technical fixes?' Let groups debate options, considering impact on learning and team morale.
Tap to view the full activity.
Why this works
Anchors theory in lived dilemmas—participants weigh tradeoffs and see how retro facilitation shapes culture.
Retro Commitment Canvas
As a closing activity, ask each participant to write one personal commitment (e.g., 'Next retro, I’ll ask more open-ended questions.') on a shared canvas or virtual board, then circle a peer’s commitment they admire. Volunteers read theirs aloud, building personal accountability.
Tap to view the full activity.
Why this works
Personal reflection and peer affirmation translate learning into action—closing the loop from theory to practice.
Sign up to unlock 3 more activities
Get the full pack, facilitation flow, and more ready-to-run ideas.