Control 2.9: Agent Performance Monitoring and Optimization

Control ID: 2.9 Pillar: Management Regulatory Reference: FINRA 4511, GLBA 501(b), SEC 17a-3/4, SOX 404 Last UI Verified: February 2026 Governance Levels: Baseline / Recommended / Regulated Last Verified: 2026-02-03

Objective

Establish comprehensive performance monitoring and optimization for AI agents to ensure reliable operation, service level compliance, and quality management. Financial regulators require demonstration that automated systems perform reliably and do not degrade over time.

Why This Matters for FSI

FINRA 4511: Performance records support books and records requirements for automated systems
GLBA 501(b): Monitoring protects customer information processing quality
SEC 17a-3/4: Agent operation records preserved for regulatory examination
SOX 404: Control effectiveness monitoring provides internal control evidence

Control Description

This control establishes performance monitoring through:

KPI Definition - Define metrics (response time, error rate, resolution rate, CSAT, containment rate)
Platform Analytics - Enable Power Platform and Copilot Studio analytics
Custom Dashboards - Create Power BI dashboards for executive reporting
Alerting Configuration - Configure threshold-based alerts for degradation
Anomaly Detection - Implement AI-powered anomaly detection for Zone 3 agents
Review Cadence - Establish weekly/monthly/quarterly performance review processes

Built-In vs Custom Monitoring Capabilities

Capability	Platform Availability	Implementation
Response time metrics	Built-in (Copilot Studio Analytics)	Native configuration
Error rates	Built-in	Native configuration
CSAT scores	Built-in	Native configuration
Answer quality scores	Built-in	Native configuration
Session/conversation counts	Built-in	Native configuration
RAI-specific telemetry	Not built-in	Custom implementation via Application Insights
Hallucination tracking	Not built-in	Custom implementation using Azure AI Evaluation SDK
Grounding accuracy metrics	Not built-in	Custom implementation required

Custom Implementation Required for RAI Monitoring

Copilot Studio provides operational metrics (response time, errors, CSAT) but does not include built-in responsible AI (RAI) telemetry such as hallucination detection, grounding accuracy measurement, or content safety violation tracking. Organizations requiring these capabilities must implement custom solutions using Azure AI Evaluation SDK, Application Insights custom events, or third-party RAI monitoring tools.

Key Configuration Points

Built-In Analytics (Native)

Define performance KPIs per governance zone (response time, error rate, CSAT targets)
Enable analytics in Power Platform Admin Center → Analytics → Copilot Studio
Configure data export to Azure Data Lake for advanced analytics
Create Power BI dashboards with KPI cards, trend analysis, and SLA compliance
Set up Power Automate alerts for error rate > threshold (5%/2%/1% by zone)
Enable Azure Monitor smart detection for Zone 3 agents
Establish performance review meetings (weekly ops, monthly business, quarterly executive)
Enable Hybrid Analytics — unified analytics view combining Copilot Studio and Microsoft Foundry agent telemetry into a single monitoring pane
Configure Autonomous Agent Analytics — specialized dashboards for monitoring autonomous (unattended) agent performance, including trigger success rates, action completion, and escalation frequency
Leverage ROI / Time & Cost Savings Analytics — built-in analytics that quantify agent business value through time saved, cost reduction, and operational efficiency metrics for executive reporting

RAI Telemetry (Custom Implementation Required)

Implement Application Insights custom events for content safety violations
Deploy Azure AI Evaluation SDK for hallucination detection in test pipelines
Create custom grounding accuracy metrics using citation validation logic
Build Power Automate flows to aggregate RAI events into dashboards
Configure alerting on RAI metric thresholds (e.g., hallucination rate > 5%)

Implementation Guidance

See Verification & Testing playbook for RAI telemetry implementation patterns.

Zone-Specific Requirements

Zone	Requirement	Implementation	Rationale
Zone 1 (Personal)	Basic analytics (built-in); monthly review; error rate alerting only	Native configuration	Low risk, minimal monitoring needed
Zone 2 (Team)	Standard dashboards (built-in); weekly review; error rate and response time alerts	Native configuration	Shared agents need quality monitoring
Zone 3 (Enterprise)	Full monitoring including RAI telemetry; daily + real-time review; all metrics including hallucination tracking; 24/7 on-call	Native + custom implementation	Customer-facing requires maximum visibility including RAI compliance

Zone 3 RAI Monitoring

Zone 3 agents in customer-facing roles should implement custom RAI telemetry to track hallucination rates, grounding accuracy, and content safety events. This requires custom development using Azure AI Evaluation SDK or equivalent tooling.

Roles & Responsibilities

Role	Responsibility
Power Platform Admin	Configure analytics, enable data export, manage dashboards
AI Governance Lead	Define KPIs, oversee review cadence, trend analysis
Operations Team	Monitor alerts, respond to degradation, execute optimization
Agent Owner	Review agent-specific performance, implement improvements

Control	Relationship
3.1 - Agent Inventory	Performance linked to inventory metadata
3.2 - Usage Analytics	Usage metrics inform performance baselines
2.6 - Model Risk Management	Performance monitoring supports MRM validation
3.10 - Hallucination Feedback	Hallucination patterns inform model degradation (Hallucination Tracker)
1.7 - Audit Logging	Performance events captured in audit logs

Implementation Playbooks

Step-by-Step Implementation

This control has detailed playbooks for implementation, automation, testing, and troubleshooting:

Portal Walkthrough — Step-by-step portal configuration
PowerShell Setup — Automation scripts
Verification & Testing — Test cases and evidence collection
Troubleshooting — Common issues and resolutions

Agent Usage & Performance Workbook

The Agent Usage & Performance Workbook provides pre-built performance monitoring dashboards including response latency percentiles (p50/p95/p99), error rates by type, and operational health indicators — deployable via Azure Monitor with RBAC-scoped access. See the Deployment Guide for setup instructions.

Verification Criteria

Confirm control effectiveness by verifying:

Built-In Analytics (All Zones)

Analytics data flows to Power Platform Admin Center reports
Power BI dashboard displays current metrics with working drill-down
Test alert triggers notification when threshold temporarily lowered
Performance review meetings scheduled with documented actions
Usage insights digest arrives at configured recipient addresses

RAI Telemetry (Zone 3, Custom Implementation)

Custom Application Insights events capture content safety violations (if implemented)
Hallucination detection pipeline produces accuracy reports (if implemented)
Grounding accuracy metrics visible in custom dashboard (if implemented)

Verification Scope

Items 6-8 apply only to organizations that have implemented custom RAI telemetry. These are not built-in platform capabilities.

Additional Resources

Updated: February 2026 | Version: v1.2 | UI Verification Status: Current