Troubleshooting: Control 2.18 - Automated Conflict of Interest Testing
Last Updated: January 2026
Common Issues
| Issue | Cause | Resolution |
|---|---|---|
| Tests not running | Schedule misconfigured or auth failure | Check schedule; verify API credentials |
| High false positive rate | Test criteria too strict | Tune evaluation criteria; add context |
| Agent responses inconsistent | Non-deterministic generation | Run multiple iterations; use averages |
| Test coverage gaps | Missing scenarios | Review with compliance; add test cases |
| Results not retained | Storage configuration error | Check storage permissions and paths |
Detailed Troubleshooting
Issue: Tests Not Executing on Schedule
Symptoms: Automated tests don't run at scheduled times
Diagnostic Steps:
- Check automation schedule:
- Power Automate: Flow run history
-
Custom: Task scheduler logs
-
Verify API authentication:
- Check credentials are valid
-
Verify API endpoint is accessible
-
Check for execution errors in logs
Resolution:
- Correct schedule configuration
- Update expired credentials
- Fix connectivity issues
- Re-enable disabled automations
Issue: High False Positive Rate
Symptoms: Tests fail but agent behavior appears acceptable on manual review
Diagnostic Steps:
- Review failure details:
- What criteria triggered failure?
-
Is the agent response actually problematic?
-
Check test criteria:
- Are patterns too strict?
-
Is context being considered?
-
Review sample failures with compliance team
Resolution:
- Adjust evaluation criteria
- Add more context to test prompts
- Use semantic analysis vs. keyword matching
- Implement confidence thresholds
Issue: Inconsistent Agent Responses
Symptoms: Same test sometimes passes, sometimes fails
Diagnostic Steps:
- Understand agent response variability:
- LLM responses have inherent variability
-
Same prompt may yield different wording
-
Check if meaning is consistent even if wording varies
-
Review test evaluation logic
Resolution:
- Run tests multiple times and use aggregate results
- Use semantic similarity vs. exact matching
- Set pass threshold (e.g., 8/10 runs pass)
- Focus on meaning, not specific wording
Issue: Coverage Gaps Identified
Symptoms: Compliance identifies COI scenarios not covered by tests
Diagnostic Steps:
-
Review current test inventory against regulatory requirements
-
Identify specific missing scenarios
-
Assess risk of gaps
Resolution:
- Add test cases for missing scenarios
- Prioritize based on risk
- Review test coverage quarterly with compliance
- Document coverage rationale
How to Confirm Configuration is Active
Test Automation
- Check recent execution history
- Verify results are being captured
- Confirm alerts are configured
Test Coverage
- Review test case inventory
- Map to COI categories
- Verify all categories have coverage
Compliance Reporting
- Generate a test report
- Verify data is current
- Confirm report reaches compliance team
Escalation Path
If issues persist after troubleshooting:
- QA/Test Team - Test framework issues
- AI/ML Team - Agent behavior analysis
- Compliance - Test criteria and coverage
- AI Governance Lead - Policy questions
Known Limitations
| Limitation | Impact | Workaround |
|---|---|---|
| LLM response variability | Test results may vary | Use aggregate scoring |
| No built-in COI detection | Requires custom implementation | Build evaluation criteria |
| Semantic analysis complexity | Hard to detect subtle bias | Combine automated and manual review |
| Test maintenance burden | Tests may become stale | Schedule quarterly test review |
Back to Control 2.18 | Portal Walkthrough | PowerShell Setup | Verification Testing