Control 4.9: Incident Reporting and Root Cause Analysis — Troubleshooting

Common issues and resolution steps for Copilot incident reporting and root cause analysis.

Common Issues

Symptoms: Known Copilot incidents occur without triggering configured alert policies.
Root Cause: Alert thresholds may be set too high, the alert policy may not be active, or the event type may not match the alert condition.
Resolution:
Verify alert policies are enabled in the Purview portal.
Review threshold settings and lower them if incidents are being missed.
Check that the alert conditions match the actual event record types in the audit log.
Test with a simulated event to confirm the alert pipeline is functional.

Symptoms: RCA reports lack sufficient detail to identify the true root cause or prevent recurrence.
Root Cause: Insufficient audit log data, lack of RCA training, or time pressure leading to surface-level analysis.
Resolution:
Ensure audit logging captures sufficient detail for Copilot interactions (Control 3.1).
Provide RCA methodology training (5 Whys, fishbone diagrams) to the incident response team.
Allow adequate time for thorough RCA rather than rushing to closure.
Implement a peer review process for RCA quality assurance.

Symptoms: Regulatory notification decisions are delayed, risking missed notification deadlines.
Root Cause: The assessment workflow lacks clear ownership, or the CCO is unavailable for timely approval.
Resolution:
Designate a deputy CCO for regulatory notification decisions when the CCO is unavailable.
Implement a time-bound escalation path (e.g., auto-escalate after 4 hours without a decision).
Pre-define notification criteria that can be assessed quickly based on incident category and impact.
Maintain a current contact list for regulatory notification recipients at each regulator.

Symptoms: Teams disagree on the severity level or category of a Copilot incident, delaying response.
Root Cause: Incident classification criteria are ambiguous or do not address Copilot-specific scenarios.
Resolution:
Review and update the incident classification criteria with Copilot-specific examples.
Establish a tie-breaking authority (e.g., CISO for security incidents, CCO for compliance incidents).
Create a decision matrix that maps incident characteristics to severity levels.
Conduct tabletop exercises to calibrate the team's classification consistency.

Check alert policy status: Navigate to Purview > Policies > Alert policies and verify status.
Review recent alerts: Check the Alerts dashboard for recent Copilot-related alerts.
Verify audit log coverage: Confirm Copilot events are present in the audit log.
Test notification delivery: Send a test alert to verify email notification delivery.

Severity	Condition	Escalation Path
Critical	Active Copilot data breach	CISO + CCO + Legal — immediate response
High	Regulatory notification deadline at risk	CCO + Legal — immediate assessment
Medium	Alert system failures	IT Security — investigate and restore
Low	RCA quality issues	Incident response team lead — process improvement