Troubleshooting: Control 2.17 - Multi-Agent Orchestration Limits
Last Updated: January 2026
Common Issues
| Issue | Cause | Resolution |
|---|---|---|
| Depth limit not enforcing | Tracking variable not implemented | Verify depth tracking in agent topics |
| Circuit breaker stuck open | Reset timeout too long or failures continue | Check downstream agent health; adjust reset timeout |
| Cascade failures occurring | Circuit breaker not configured | Implement circuit breaker pattern |
| HITL timeouts causing abandonment | Timeout too short or approvers unavailable | Adjust timeout; ensure approver coverage |
| Orchestration too slow | Multiple sequential calls | Consider parallel calls where safe |
Detailed Troubleshooting
Issue: Delegation Depth Limit Not Enforcing
Symptoms: Agents can chain beyond configured depth limits
Diagnostic Steps:
-
Verify depth tracking variable exists:
Copilot Studio > Agent > Topics > Check for orchestration_depth variable -
Check depth increment logic:
- Variable should increment before each delegation
-
Should be passed to delegated agent
-
Verify depth check condition:
- Condition should compare against max_depth
- Should block or error if exceeded
Resolution:
- Implement depth tracking if missing
- Fix increment logic (ensure it increments BEFORE delegation)
- Add proper condition check before delegation calls
- Consider using Power Automate for complex orchestration with better control
Issue: Circuit Breaker Stuck Open
Symptoms: Delegated agent calls permanently blocked even after issue resolved
Diagnostic Steps:
-
Check circuit breaker state in your monitoring system
-
Verify the downstream agent is actually healthy:
- Test direct calls to the agent
-
Check agent health metrics
-
Check reset timeout configuration:
- Timeout may be longer than expected
- Half-open test may be failing
Resolution:
- Manually reset circuit breaker if available
- Fix downstream agent issues
- Adjust reset timeout to appropriate duration
- Verify half-open test is configured correctly
Issue: Cascade Failures in Orchestration
Symptoms: One agent failure causes entire chain to fail
Diagnostic Steps:
-
Check if circuit breakers are implemented
-
Review error handling in orchestrating agents:
- Are failures being caught?
-
Is there fallback behavior?
-
Check timeout configuration:
- Timeouts should be shorter than total allowed time
- Cascading timeouts should not exceed total
Resolution:
- Implement circuit breakers on all agent-to-agent calls
- Add proper error handling with fallbacks
- Configure appropriate timeouts at each level
- Consider bulkhead pattern for isolation
Issue: HITL Causing User Abandonment
Symptoms: Users leave before HITL approval completes
Diagnostic Steps:
- Check HITL timeout configuration:
- Is timeout appropriate for the approval process?
-
Are approvers available during business hours?
-
Review approval routing:
- Are requests going to available approvers?
-
Is there a backup approver chain?
-
Check user communication:
- Are users informed of the wait?
- Is there a way to check status?
Resolution:
- Adjust timeout based on actual approval times
- Implement backup approver chain
- Provide status updates to waiting users
- Consider async patterns where appropriate
How to Confirm Configuration is Active
Depth Limiting
- Create a test scenario that would exceed depth
- Verify the request is blocked
- Check logs for depth violation event
Circuit Breaker
- Simulate failures in a test environment
- Verify circuit opens after threshold
- Wait for reset and verify it closes
HITL Checkpoints
- Trigger a HITL-required operation
- Verify approval request is generated
- Test approval and denial paths
Escalation Path
If issues persist after troubleshooting:
- Copilot Studio Admin - Agent configuration issues
- Power Automate Admin - Flow-based orchestration issues
- AI Governance Lead - Policy and limit questions
- Microsoft Support - Platform limitations
Known Limitations
| Limitation | Impact | Workaround |
|---|---|---|
| No native depth tracking | Must implement manually | Use conversation variables; document pattern |
| Circuit breaker not built-in | Requires custom implementation | Use Power Automate with error handling |
| Limited visibility into chains | Hard to trace multi-agent flows | Implement correlation IDs; use Application Insights |
| HITL requires polling | No native webhook support | Use Power Automate adaptive cards |
| Cross-environment orchestration complex | Multi-env chains hard to manage | Keep orchestration within single environment |
Back to Control 2.17 | Portal Walkthrough | PowerShell Setup | Verification Testing