Skip to content

FSI Agent Governance Framework

Troubleshooting

judeper/FSI-AgentGov

Troubleshooting: Control 2.18 - Automated Conflict of Interest Testing

Last Updated: January 2026

Common Issues

Issue	Cause	Resolution
Tests not running	Schedule misconfigured or auth failure	Check schedule; verify API credentials
High false positive rate	Test criteria too strict	Tune evaluation criteria; add context
Agent responses inconsistent	Non-deterministic generation	Run multiple iterations; use averages
Test coverage gaps	Missing scenarios	Review with compliance; add test cases
Results not retained	Storage configuration error	Check storage permissions and paths

Detailed Troubleshooting

Issue: Tests Not Executing on Schedule

Symptoms: Automated tests don't run at scheduled times

Diagnostic Steps:

Check automation schedule:
Power Automate: Flow run history
Custom: Task scheduler logs
Verify API authentication:
Check credentials are valid
Verify API endpoint is accessible
Check for execution errors in logs

Resolution:

Correct schedule configuration
Update expired credentials
Fix connectivity issues
Re-enable disabled automations

Issue: High False Positive Rate

Symptoms: Tests fail but agent behavior appears acceptable on manual review

Diagnostic Steps:

Review failure details:
What criteria triggered failure?
Is the agent response actually problematic?
Check test criteria:
Are patterns too strict?
Is context being considered?
Review sample failures with compliance team

Resolution:

Adjust evaluation criteria
Add more context to test prompts
Use semantic analysis vs. keyword matching
Implement confidence thresholds

Issue: Inconsistent Agent Responses

Symptoms: Same test sometimes passes, sometimes fails

Diagnostic Steps:

Understand agent response variability:
LLM responses have inherent variability
Same prompt may yield different wording
Check if meaning is consistent even if wording varies
Review test evaluation logic

Resolution:

Run tests multiple times and use aggregate results
Use semantic similarity vs. exact matching
Set pass threshold (e.g., 8/10 runs pass)
Focus on meaning, not specific wording

Issue: Coverage Gaps Identified

Symptoms: Compliance identifies COI scenarios not covered by tests

Diagnostic Steps:

Review current test inventory against regulatory requirements
Identify specific missing scenarios
Assess risk of gaps

Resolution:

Add test cases for missing scenarios
Prioritize based on risk
Review test coverage quarterly with compliance
Document coverage rationale

How to Confirm Configuration is Active

Test Automation

Check recent execution history
Verify results are being captured
Confirm alerts are configured

Test Coverage

Review test case inventory
Map to COI categories
Verify all categories have coverage

Compliance Reporting

Generate a test report
Verify data is current
Confirm report reaches compliance team

Escalation Path

If issues persist after troubleshooting:

QA/Test Team - Test framework issues
AI/ML Team - Agent behavior analysis
Compliance - Test criteria and coverage
AI Governance Lead - Policy questions

Known Limitations

Limitation	Impact	Workaround
LLM response variability	Test results may vary	Use aggregate scoring
No built-in COI detection	Requires custom implementation	Build evaluation criteria
Semantic analysis complexity	Hard to detect subtle bias	Combine automated and manual review
Test maintenance burden	Tests may become stale	Schedule quarterly test review

Back to Control 2.18 | Portal Walkthrough | PowerShell Setup | Verification Testing