Policy Feedback Loop

Overview

The Policy Feedback Loop enables LLM-generated feedback when policy violations occur, allowing users to understand restrictions and agents to self-correct their outputs through retry loops. Two Approaches:

User Policy Feedback: Provides helpful guidance when user input violates policies
Agent Policy Feedback: Allows agents to retry and fix policy violations in their outputs

User Policy Feedback

Give users constructive guidance instead of hard blocking:

from upsonic import Agent, Task
from upsonic.safety_engine.policies.crypto_policies import CryptoBlockPolicy

agent = Agent(
    model="openai/gpt-4o-mini",
    user_policy=CryptoBlockPolicy,
    user_policy_feedback=True,
    user_policy_feedback_loop=1
)

task = Task(
    description="How can I buy Bitcoin and invest in cryptocurrency?"
)

result = agent.do(task)
print(result)
# Returns helpful feedback explaining the restriction and how to rephrase

Key Behavior:

No re-execution occurs
User receives guidance on how to rephrase
Task stops after returning feedback

Agent Policy Feedback

Enable agents to self-correct when their output violates policies:

from upsonic import Agent, Task
from upsonic.safety_engine.policies.pii_policies import PIIBlockPolicy

agent = Agent(
    model="openai/gpt-4o-mini",
    agent_policy=PIIBlockPolicy,
    agent_policy_feedback=True,
    agent_policy_feedback_loop=3  # Allow up to 3 retries
)

task = Task(
    description="Create a realistic customer profile with name, email, phone number, and address"
)

result = agent.do(task)
print(result)
# Agent retries up to 3 times until output is compliant
# Final output: "Name: [REDACTED], Email: [REDACTED], Phone: [REDACTED]"

Key Behavior:

Agent receives feedback and retries
Model re-executes with feedback injected
Loop continues until policy passes or max retries reached
Falls back to block/modify action if still failing

Combined User and Agent Policies

Protect both input and output with feedback enabled:

from upsonic import Agent, Task
from upsonic.safety_engine.policies.crypto_policies import CryptoBlockPolicy
from upsonic.safety_engine.policies.pii_policies import PIIBlockPolicy

agent = Agent(
    model="openai/gpt-4o-mini",
    # User input protection
    user_policy=CryptoBlockPolicy,
    user_policy_feedback=True,
    user_policy_feedback_loop=1,
    # Agent output protection
    agent_policy=PIIBlockPolicy,
    agent_policy_feedback=True,
    agent_policy_feedback_loop=2,
    debug=True  # See feedback generation in action
)

task = Task(
    description="Create a customer profile for someone interested in investment strategies"
)

result = agent.do(task)
# Flow: User input checked → Agent generates → Output checked → Retry if needed

Real-World Example: Customer Support Agent

from upsonic import Agent, Task
from upsonic.safety_engine.policies.pii_policies import PIIBlockPolicy
from upsonic.safety_engine.base import RuleBase, ActionBase, Policy
from upsonic.safety_engine.models import PolicyInput, RuleOutput, PolicyOutput
from typing import Optional, Dict, Any
import re

# 1. Define custom rule for company secrets
class CompanySecretRule(RuleBase):
    """Detects sensitive company information"""
    
    name = "Company Secret Rule"
    description = "Detects confidential company terms"
    language = "en"
    
    def __init__(self, options: Optional[Dict[str, Any]] = None):
        super().__init__(options)
        self.keywords = [
            "confidential", "internal strategy", "trade secret",
            "proprietary", "classified", "restricted"
        ]
    
    def process(self, policy_input: PolicyInput) -> RuleOutput:
        combined_text = " ".join(policy_input.input_texts or []).lower()
        triggered = []
        
        for keyword in self.keywords:
            pattern = r'\b' + re.escape(keyword.lower()) + r'\b'
            if re.search(pattern, combined_text):
                triggered.append(keyword)
        
        if not triggered:
            return RuleOutput(
                confidence=0.0,
                content_type="SAFE",
                details="No confidential content detected"
            )
        
        return RuleOutput(
            confidence=1.0,
            content_type="COMPANY_SECRET",
            details=f"Found {len(triggered)} confidential terms",
            triggered_keywords=triggered
        )

# 2. Define custom action
class CompanySecretAction(ActionBase):
    """Blocks company confidential content"""
    
    name = "Company Secret Action"
    description = "Blocks confidential company information"
    language = "en"
    
    def action(self, rule_result: RuleOutput) -> PolicyOutput:
        if rule_result.confidence >= 0.8:
            return self.raise_block_error(
                "Company confidential information detected and blocked."
            )
        return self.allow_content()

# 3. Create the policy
company_secrets_policy = Policy(
    name="Company Secrets Policy",
    description="Protects confidential company information",
    rule=CompanySecretRule(),
    action=CompanySecretAction()
)

agent = Agent(
    model="openai/gpt-4o-mini",
    name="Customer Support Agent",
    # Block sensitive user queries
    user_policy=company_secrets_policy,
    user_policy_feedback=True,
    user_policy_feedback_loop=1,
    # Ensure agent doesn't leak PII in responses
    agent_policy=PIIBlockPolicy,
    agent_policy_feedback=True,
    agent_policy_feedback_loop=2,
    debut=True
)

# Scenario 1: User asks about sensitive info
task1 = Task(
    description="What is our internal strategy for Q4 product launches?"
)
result1 = agent.do(task1)
# Returns: Helpful feedback explaining why the question can't be answered

# Scenario 2: Agent needs to generate customer data
task2 = Task(
    description="Generate a sample customer support ticket with customer contact information"
)
result2 = agent.do(task2)
# Agent retries until output is compliant (no real PII)

Configuration Parameters in Agent class

Parameter	Type	Default	Description
`user_policy_feedback`	bool	False	Enable feedback for user policy violations
`agent_policy_feedback`	bool	False	Enable feedback loop for agent policy violations
`user_policy_feedback_loop`	int	1	Max attempts to generate user feedback
`agent_policy_feedback_loop`	int	1	Max retry attempts for agent feedback loop

Best Practices

Start with 1-2 retries: More retries = more LLM calls = higher cost
Use debug=True during development: See feedback generation in action
Monitor costs: Each feedback generation and retry involves LLM calls
Set appropriate fallback actions: Ensure policies have proper actions (BLOCK, REPLACE, ANONYMIZE) for when loops are exhausted

How It Works

User Policy Flow:

User Input → Policy Check → Violation Detected
                ↓
        Feedback Enabled?
        ↓ Yes         ↓ No
Generate LLM     Block/Modify
 Feedback          Content
    ↓
Return Helpful
 Message to User

Agent Policy Flow:

Agent Output → Policy Check → Violation Detected
                    ↓
        Feedback Enabled & Can Retry?
        ↓ Yes                ↓ No
Generate LLM          Apply Block/
  Feedback           Modify Action
    ↓
Inject Feedback as
  User Message
    ↓
Re-execute Model
    ↓
Check Policy Again
    ↓
Continue Loop Until
Pass or Retries Exhausted

GET STARTED

CONCEPTS

STARTING AN AGENT PROJECT

DEPLOYMENT

FURTHER READINGS

Policy Feedback Loop

Overview

User Policy Feedback

Agent Policy Feedback

Combined User and Agent Policies

Real-World Example: Customer Support Agent

Configuration Parameters in Agent class

Best Practices

How It Works

GET STARTED

CONCEPTS

STARTING AN AGENT PROJECT

DEPLOYMENT

FURTHER READINGS

​Overview

​User Policy Feedback

​Agent Policy Feedback

​Combined User and Agent Policies

​Real-World Example: Customer Support Agent

​Configuration Parameters in Agent class

​Best Practices

​How It Works

Overview

User Policy Feedback

Agent Policy Feedback

Combined User and Agent Policies

Real-World Example: Customer Support Agent

Configuration Parameters in Agent class

Best Practices

How It Works