Problem Statement
What steps should a DevOps team take after a major production outage?
Explanation
First, restore service using rollback or backup.
Then conduct a post-mortem to identify the root cause, update runbooks, and improve monitoring to prevent recurrence.