Control 2.4: Business Continuity and Disaster Recovery — Portal Walkthrough
Portal-based configuration guidance for Control 2.4: Business Continuity and Disaster Recovery.
Audience and Scope
This walkthrough is for M365 administrators in US financial services who configure Power Platform business continuity and disaster recovery (BC/DR) for Copilot Studio agents. It covers customer-managed steps in the Power Platform Admin Center (PPAC), Power Apps maker portal, Microsoft 365 admin center, and Microsoft Entra admin center as of April 2026. It does not cover Microsoft-side platform recovery, which is governed by the Online Services Terms and SLA.
Microsoft does not auto-failover Dataverse cross-region
Microsoft maintains in-region high availability for Dataverse and Copilot Studio. Cross-region recovery is the customer's responsibility and requires a pre-provisioned secondary-region environment, customer-managed solution and data exports, and a documented runbook. The steps below operationalize that responsibility.
Prerequisites
Before starting, confirm you have:
- Power Platform Admin role (canonical role from
docs/reference/role-catalog.md) — required for environment provisioning, manual backup, copy-environment operations, and tenant deployment settings - Environment Admin on the source production environment and the target DR environment — required to manage DLP exemptions, security roles, environment variables, and connection references
- Microsoft 365 Service Health Reader (or Global Reader) for at least one named admin account, plus an admin distribution list — required for Service Health and Message Center subscriptions
- Entra Agent ID Admin (where Entra Agent ID is enabled) — required to confirm agent identity continuity in the DR environment
- An approved Business Impact Analysis (BIA) listing the agents in scope, their criticality tier, and board-ratified RTO / RPO targets — see Control 2.4 §Zone-Specific Requirements
- An identified secondary Azure region in the same regulatory geography as your primary region (for example: East US ↔ West US, or North Europe ↔ West Europe). Confirm the secondary region against Azure regions for Power Platform
Overview
This walkthrough follows the customer-side BC/DR lifecycle for Copilot Studio agents:
| Part | Activity | Owner |
|---|---|---|
| 1 | Classify agents and ratify RTO / RPO in the BIA | AI Governance Lead |
| 2 | Provision and harden the secondary-region environment | Power Platform Admin |
| 3 | Confirm Microsoft system backups and configure manual backups | Power Platform Admin |
| 4 | Configure customer-managed solution and data exports | Power Platform Admin |
| 5 | Validate agent identity continuity in the DR environment | Entra Agent ID Admin |
| 6 | Subscribe to Service Health and Message Center alerts | Power Platform Admin |
| 7 | Author the DR runbook (declaration, failover, failback) | AI Governance Lead |
| 8 | Schedule and document recovery exercises | Compliance Officer |
Part 1 — Classify Agents and Ratify RTO / RPO
The BIA is the source of truth for which agents enter scope, what their tier is, and what their RTO / RPO targets are. The portal does not store this — it is a governance artifact maintained alongside the firm's BCP.
For each agent, capture in the BIA:
- Agent display name and Copilot Studio bot ID
- Hosting environment (production environment ID + region)
- Business owner and recovery owner
- Criticality tier (1 / 2 / 3 / 4) per Control 2.4 §Agent Criticality Classification
- RTO and RPO ratified by the BCP committee
- Dependencies: knowledge sources, custom connectors, environment variables, downstream APIs, identity (service principal / Entra Agent ID), Power Automate flows
- Recovery priority order within the tier
Cross-reference the agent inventory maintained under Control 3.1.
Part 2 — Provision and Harden the Secondary-Region Environment
Portal path (April 2026): Power Platform Admin Center → Resources → Environments → + New
2.1 Create the DR environment
- Open Power Platform Admin Center and sign in with a Power Platform Admin account
- Select Resources → Environments → + New
- Configure:
- Name:
<PrimaryEnvName>-DR - Region: the pre-approved secondary region for the primary's geography
- Type: Production (Tier 1 / Tier 2) or Sandbox (Tier 3 internal-only agents — note that sandbox retention is shorter)
- Add a Dataverse data store: Yes
- Currency / Language: match the primary environment
- Security group: apply the same restricted security group used in production
- Select Save and wait for provisioning to complete
2.2 Enable Managed Environments and apply parity policies
- Select the new DR environment → Settings → Product → Features
- Enable Managed Environment (required for any environment hosting Zone 2 or Zone 3 agents — see Control 2.1)
- Apply the same DLP policies that govern the primary environment: Policies → Data policies → assign the DR environment to the same policy scopes
- Reproduce in the DR environment:
- Security roles and team memberships
- Environment variables (values appropriate for the DR region — for example, regional API endpoints)
- Connection references (placeholder; will be re-bound during failover)
- Conditional Access targeting the DR environment URL
Document the region pairing in your BCP
Record the primary ↔ DR mapping in the BIA, not just inside PPAC. Examiners look for documented region pairing decisions, including the data-residency rationale.
Part 3 — Confirm Microsoft System Backups and Configure Manual Backups
Portal path (April 2026): Power Platform Admin Center → Resources → Environments → <env> → Settings → Updates → Backup & restore (or the Backups action on the environment command bar)
3.1 Verify system backups
- Open the production environment in PPAC
- From the command bar, select Backups
- Confirm:
- System backups are listed for the past 28 days (production) or 7 days (sandbox) — these are continuous Dataverse backups created automatically
- The most recent system backup timestamp is current
System backups support Microsoft-side recovery for in-region events. They restore the entire environment, not individual agents or solutions, and they are not portable cross-region.
3.2 Create a manual backup before any high-risk change
- From the Backups view select Create
- Enter a Label describing the change (for example,
Pre-deploy TradingAssistant 4.2.0) - Select Create
- Manual backups are retained per the published Power Platform retention policy — verify current retention against Backup and restore environments before relying on a specific window
3.3 Restore drill (sandbox only — never on production without a change ticket)
- Open a non-production environment in PPAC
- Select Backups → choose a system or manual backup → Restore
- Choose Restore over an existing environment (sandbox target) or Restore to a new environment
- Record the start and finish times in the exercise log — this measures Microsoft-side restore time as one input to RTO
Part 4 — Configure Customer-Managed Solution and Data Exports
System backups alone do not satisfy SEC 17a-4 reconstruction obligations and do not support cross-region recovery. Customer-managed exports are mandatory for Zone 2 and Zone 3.
4.1 Solution exports (managed)
Portal path: Power Apps maker portal → Solutions → select solution → Export solution
- Open Power Apps in the production environment
- Select Solutions → choose the solution containing the agent
- Select Export solution → Publish all customizations → Next
- Choose Managed → set Version following the Control 2.3 versioning convention
- Export and save the
.zipto immutable / WORM-capable storage (Azure Blob with immutability policy, or equivalent) - Repeat per the cadence required by the agent's RPO target
Solution export does not capture every Copilot Studio component
Knowledge sources, certain custom topics, and external data referenced by the agent may not be inside the solution package. Verify component coverage before relying on the export for full recovery. Maintain separate export procedures for excluded components and document them in the runbook.
4.2 Dataverse data exports
For agents that read or write Dataverse tables containing records subject to retention:
- Use Azure Synapse Link for Dataverse (PPAC → Environment → Azure Synapse Link) for continuous near-real-time replication to ADLS Gen2, or
- Use the Dataverse Web API / Dataflows for scheduled batch exports
Store outputs in immutable storage with a retention period meeting the longest of: SEC 17a-4 (six years for most records), FINRA 4511, and any applicable state regulation.
4.3 Pre-stage solutions in the DR environment
- In the DR environment, Solutions → Import solution
- Import the latest managed solution
- Bind connection references to DR-region connections (will be re-validated at failover)
- Verify environment variables resolve to DR-appropriate values
Refresh the DR environment on a schedule that keeps it within the RPO window — for Zone 3, this typically means daily at minimum and ideally automated via Power Platform pipelines (see Control 2.3).
Part 5 — Validate Agent Identity Continuity
Entra app registrations, service principals, federated credentials, and Entra Agent IDs are tenant-global — they are not bound to a Power Platform region and survive a single-region outage without re-registration. Validation focuses on confirming that the DR environment can authenticate using them.
Portal path: Microsoft Entra admin center → Identity → Applications → App registrations
- For each app registration or Agent ID used by an in-scope agent, confirm:
- The application is multi-tenant or single-tenant consistent with the primary
- Federated credentials (workload identity federation) are present and not expired
- Client secrets are stored in a Key Vault that is itself replicated cross-region (Azure Key Vault is region-paired by default)
- API permissions include any Graph or Dataverse scopes the agent uses
- In the DR Power Platform environment, register the application user (Power Platform Admin Center → DR environment → Settings → Users + permissions → Application users → + New app user)
- Assign the same Dataverse security role used in production
- Document the credential rotation owner and cadence in the BIA
Part 6 — Subscribe to Service Health and Message Center Alerts
Portal path: Microsoft 365 admin center → Health → Service health → Customize alerts
- Open Microsoft 365 admin center → Health → Service health
- Select Customize alerts → Add a new rule
- Subscribe a DR distribution list (not a single inbox) to alerts for:
- Power Platform
- Dynamics 365 (covers Dataverse advisories)
- Microsoft Copilot Studio
- Microsoft Entra
- Power Automate
- Repeat for Message Center under Health → Message center → Preferences → enable email digest for the same audiences
- Document the notification matrix in the runbook:
| Severity | Initial response | Escalation channel |
|---|---|---|
| Critical (active outage affecting Tier 1) | Immediate | DR bridge call + executive notification |
| High (degraded service for Tier 1 / Tier 2) | ≤ 15 minutes | Teams DR channel + email |
| Medium (advisory affecting future operations) | ≤ 1 hour | |
| Informational | Next business day | Digest |
Part 7 — Author the DR Runbook
The runbook is a controlled document maintained alongside the BCP. It should be version-controlled, reviewed at least annually, and approved by Compliance.
7.1 Declaration criteria
Document the conditions that authorize a DR declaration. Examples:
- Microsoft confirms a regional service incident affecting the primary Power Platform region with no ETA, or the incident exceeds the agent's RTO
- Tier 1 agent is unavailable for more than 30 minutes with no in-region resolution path
- Security incident requires isolation of the primary environment
7.2 Declaration authority
| Role | Authority |
|---|---|
| CIO or delegate | Tier 1 declaration |
| IT Operations Director | Tier 2 declaration |
| On-call IT Manager | After-hours declaration with mandatory CIO notification |
7.3 Tier 1 failover sequence (target ≤ 60 minutes)
- Declare (≤ 5 min): On-call manager confirms criteria, opens incident ticket, notifies leadership distribution list
- Validate DR readiness (≤ 10 min): Confirm DR environment status, latest solution version present, application users active
- Re-bind connections (≤ 15 min): Update connection references in DR to live credentials; refresh OAuth tokens for premium connectors
- Cut over traffic (≤ 10 min): Update Azure Front Door / Traffic Manager / Teams app endpoint / channel publish targets to point at the DR agent
- Smoke test (≤ 15 min): Execute the agent test script (see Verification & Testing) and validate critical flows
- Communicate (≤ 5 min): Notify business owners and users; update status page; log declaration time and recovery time
7.4 Failback sequence
- Confirm primary region is stable (Microsoft all-clear posted; no further service health advisories)
- Inventory data and configuration changes made in the DR environment during the outage
- Export modified solutions and any Dataverse data changes from DR
- Import to the primary environment and reconcile conflicts
- Cut traffic back to primary; place DR back into warm-standby state
- File post-event report (see Control 3.4)
Part 8 — Schedule and Document Recovery Exercises
Cadence: Annual for Zone 2; quarterly for Zone 3 (mix of tabletop and live failover).
8.1 Pre-exercise (≥ 2 weeks ahead)
- Schedule the exercise window with all participants and notify business owners
- Define the scenario (regional outage, tenant-wide auth failure, malicious deletion of an agent, etc.)
- Confirm the success criteria (RTO / RPO met, runbook accurate, no undocumented dependencies)
- Brief the DR team and assign an exercise observer
8.2 Exercise execution
- Follow the runbook exactly — observers note any deviation, ambiguity, or undocumented step
- Capture timestamps for each phase
- Document any data integrity issues found post-failover
8.3 Post-exercise documentation (retain ≥ 6 years)
- Exercise summary (scenario, scope, participants, observer)
- Measured RTO and RPO vs. target
- Gaps and corrective actions tracked to closure with owners and dates
- Compliance Officer sign-off
This evidence supports FFIEC examination, FINRA 4370 review, and OCC Heightened Standards (Appendix D) independent assurance.
Validation
After completing all parts, confirm:
- The BIA lists every Zone 2 and Zone 3 agent with tier, RTO, RPO, dependencies, and recovery owner
- A secondary-region environment exists for each Zone 3 production environment with parity policies and current solutions
- Manual and system backups are visible in PPAC for all in-scope production environments
- Customer-managed solution exports are landing in immutable storage on the cadence required by RPO
- Application users for agent identities are registered in the DR environment with the same Dataverse roles
- Service Health and Message Center alerts route to the DR distribution list, not a single mailbox
- The DR runbook is published, version-controlled, and approved by Compliance
- The most recent exercise has documented results and corrective actions
Back to Control 2.4 | PowerShell Setup | Verification & Testing | Troubleshooting
Updated: April 2026 | Version: v1.4.0