Control 2.9: Agent Performance Monitoring and Optimization
Control ID: 2.9 Pillar: Management Regulatory Reference: FINRA 4511, GLBA 501(b), SEC 17a-3/4, SOX 404 Last UI Verified: February 2026 Governance Levels: Baseline / Recommended / Regulated Last Verified: 2026-02-03
Objective
Establish comprehensive performance monitoring and optimization for AI agents to ensure reliable operation, service level compliance, and quality management. Financial regulators require demonstration that automated systems perform reliably and do not degrade over time.
Why This Matters for FSI
- FINRA 4511: Performance records support books and records requirements for automated systems
- GLBA 501(b): Monitoring protects customer information processing quality
- SEC 17a-3/4: Agent operation records preserved for regulatory examination
- SOX 404: Control effectiveness monitoring provides internal control evidence
Control Description
This control establishes performance monitoring through:
- KPI Definition - Define metrics (response time, error rate, resolution rate, CSAT, containment rate)
- Platform Analytics - Enable Power Platform and Copilot Studio analytics
- Custom Dashboards - Create Power BI dashboards for executive reporting
- Alerting Configuration - Configure threshold-based alerts for degradation
- Anomaly Detection - Implement AI-powered anomaly detection for Zone 3 agents
- Review Cadence - Establish weekly/monthly/quarterly performance review processes
Built-In vs Custom Monitoring Capabilities
| Capability | Platform Availability | Implementation |
|---|---|---|
| Response time metrics | Built-in (Copilot Studio Analytics) | Native configuration |
| Error rates | Built-in | Native configuration |
| CSAT scores | Built-in | Native configuration |
| Answer quality scores | Built-in | Native configuration |
| Session/conversation counts | Built-in | Native configuration |
| RAI-specific telemetry | Not built-in | Custom implementation via Application Insights |
| Hallucination tracking | Not built-in | Custom implementation using Azure AI Evaluation SDK |
| Grounding accuracy metrics | Not built-in | Custom implementation required |
Custom Implementation Required for RAI Monitoring
Copilot Studio provides operational metrics (response time, errors, CSAT) but does not include built-in responsible AI (RAI) telemetry such as hallucination detection, grounding accuracy measurement, or content safety violation tracking. Organizations requiring these capabilities must implement custom solutions using Azure AI Evaluation SDK, Application Insights custom events, or third-party RAI monitoring tools.
Key Configuration Points
Built-In Analytics (Native)
- Define performance KPIs per governance zone (response time, error rate, CSAT targets)
- Enable analytics in Power Platform Admin Center → Analytics → Copilot Studio
- Configure data export to Azure Data Lake for advanced analytics
- Create Power BI dashboards with KPI cards, trend analysis, and SLA compliance
- Set up Power Automate alerts for error rate > threshold (5%/2%/1% by zone)
- Enable Azure Monitor smart detection for Zone 3 agents
- Establish performance review meetings (weekly ops, monthly business, quarterly executive)
- Enable Hybrid Analytics — unified analytics view combining Copilot Studio and Microsoft Foundry agent telemetry into a single monitoring pane
- Configure Autonomous Agent Analytics — specialized dashboards for monitoring autonomous (unattended) agent performance, including trigger success rates, action completion, and escalation frequency
- Leverage ROI / Time & Cost Savings Analytics — built-in analytics that quantify agent business value through time saved, cost reduction, and operational efficiency metrics for executive reporting
RAI Telemetry (Custom Implementation Required)
- Implement Application Insights custom events for content safety violations
- Deploy Azure AI Evaluation SDK for hallucination detection in test pipelines
- Create custom grounding accuracy metrics using citation validation logic
- Build Power Automate flows to aggregate RAI events into dashboards
- Configure alerting on RAI metric thresholds (e.g., hallucination rate > 5%)
Implementation Guidance
See Verification & Testing playbook for RAI telemetry implementation patterns.
Zone-Specific Requirements
| Zone | Requirement | Implementation | Rationale |
|---|---|---|---|
| Zone 1 (Personal) | Basic analytics (built-in); monthly review; error rate alerting only | Native configuration | Low risk, minimal monitoring needed |
| Zone 2 (Team) | Standard dashboards (built-in); weekly review; error rate and response time alerts | Native configuration | Shared agents need quality monitoring |
| Zone 3 (Enterprise) | Full monitoring including RAI telemetry; daily + real-time review; all metrics including hallucination tracking; 24/7 on-call | Native + custom implementation | Customer-facing requires maximum visibility including RAI compliance |
Zone 3 RAI Monitoring
Zone 3 agents in customer-facing roles should implement custom RAI telemetry to track hallucination rates, grounding accuracy, and content safety events. This requires custom development using Azure AI Evaluation SDK or equivalent tooling.
Roles & Responsibilities
| Role | Responsibility |
|---|---|
| Power Platform Admin | Configure analytics, enable data export, manage dashboards |
| AI Governance Lead | Define KPIs, oversee review cadence, trend analysis |
| Operations Team | Monitor alerts, respond to degradation, execute optimization |
| Agent Owner | Review agent-specific performance, implement improvements |
Related Controls
| Control | Relationship |
|---|---|
| 3.1 - Agent Inventory | Performance linked to inventory metadata |
| 3.2 - Usage Analytics | Usage metrics inform performance baselines |
| 2.6 - Model Risk Management | Performance monitoring supports MRM validation |
| 3.10 - Hallucination Feedback | Hallucination patterns inform model degradation (Hallucination Tracker) |
| 1.7 - Audit Logging | Performance events captured in audit logs |
Implementation Playbooks
Step-by-Step Implementation
This control has detailed playbooks for implementation, automation, testing, and troubleshooting:
- Portal Walkthrough — Step-by-step portal configuration
- PowerShell Setup — Automation scripts
- Verification & Testing — Test cases and evidence collection
- Troubleshooting — Common issues and resolutions
Agent Usage & Performance Workbook
The Agent Usage & Performance Workbook provides pre-built performance monitoring dashboards including response latency percentiles (p50/p95/p99), error rates by type, and operational health indicators — deployable via Azure Monitor with RBAC-scoped access. See the Deployment Guide for setup instructions.
Verification Criteria
Confirm control effectiveness by verifying:
Built-In Analytics (All Zones)
- Analytics data flows to Power Platform Admin Center reports
- Power BI dashboard displays current metrics with working drill-down
- Test alert triggers notification when threshold temporarily lowered
- Performance review meetings scheduled with documented actions
- Usage insights digest arrives at configured recipient addresses
RAI Telemetry (Zone 3, Custom Implementation)
- Custom Application Insights events capture content safety violations (if implemented)
- Hallucination detection pipeline produces accuracy reports (if implemented)
- Grounding accuracy metrics visible in custom dashboard (if implemented)
Verification Scope
Items 6-8 apply only to organizations that have implemented custom RAI telemetry. These are not built-in platform capabilities.
Additional Resources
- Microsoft Learn: Copilot Studio Analytics Overview
- Microsoft Learn: Power Platform Analytics
- Microsoft Learn: Export Analytics to Azure
- Microsoft Learn: Azure Monitor Alerts
Updated: February 2026 | Version: v1.2 | UI Verification Status: Current