Skip to main content

What’s an AI Agent?

They are programs that are capable of creating intelligent outputs for any given task. Traditional software is designed for one specific thing and follows a sequential flow. Agents, however, dynamically prepare themselves to perform tasks and determine their own needs and actions. They are like general Machine Learning (ML) models, meaning you can use them for a variety of tasks. What do they do?
  • Understand requests
  • Classify and extract any kind of data (text, image, audio, and video)
  • Make decisions and generate intelligent outputs (as text, image, audio, or other output)
  • Connect to other tools and data to complete or assist operations (e.g., making requests to APIs, retrieving data, performing operations)
What are the parts of an agent?
  • Characterization: This is the most important part. Agents are like 1000 different experts, but for any given task, we only need a few of them. For this reason, we characterize them specifically for our tasks.
  • Reasoning: When agents engage in more extensive thinking, it leads to more accurate answers. Reasoning tools analyze operations, create plans, analyze tool and agent outputs, make revisions, and replan. We offer pre-built reasoning and thinking tools that you can enable in the required agents.
  • Memory: Just like humans, agents have the ability to store and retrieve information from previous operations. This allows them to customize and specify themselves according to user preferences, providing personalized answers.
  • Knowledge Base: In many cases, we have a lot of data to improve their answers. Sometimes we want to provide extensive information and then ask for specific parts of it. In such cases, we perform RAG (Retrieval Augmented Generation), and we have special search agents that we call Agentic Search for this feature.
  • Storage: We provide integrations to local and cloud storage options for storing memories and knowledge bases.

Core Principles For Agents

When you creating an an AI Agent, ensure that you define these elements as well:
  • Role: You need to give an role defination to your agent that match with the task of the agent as well.
  • Goal: Agents needs some goals to focus on the thing that they are doing right now.
  • Instructions: You need to give some explainations for your agents to keep them that making the right think in right time.

Defining Roles

The role will explain agent like you are the expert for that. But there is some points to defining a good role:
  • Be Specific: Dont write Operation Specialist, write: Onboarding Specialist
  • Add Domain information: Add the domain information for make your agent more aware (e.g., “Onboarding Specialist in fintech industry”)
  • Roles should be real worlds professions: Your agents knows the real worlds professions. So please try to choose them.
Good Role:
role="Onboarding Specialist in fintech industry"
role="CFO in fintech industry and market is EU"
role="Head Of Development in Health Tech"
Bad Role:
role="Operation Specialist"
role="Finance Expert"
role="Developer"

Defining Goals

Agents are trying to complete their goals while making their tasks. So they are important for the overall requests.
  • Write the most important points for you: Like, care about the EU standards
  • Write the things which is the important for this role at your company
  • Design a Perfect Employee actually.
Good Goals:
goal="Provide actionable insights to business teams by analyzing product trends on merchant websites"
goal="Ensure financial reports for EU fintech startups are complete and fully GDPR compliant"
goal="Maintain up-to-date API documentation that consistently meets ISO 9001 standards"
Bad Goals:
goal="Analyze products"
goal="Prepare financial reports"
goal="Write documentation"

Defining Instructions

Instructions guide your agent to act correctly and efficiently during its tasks. When writing instructions:
  • Include specific details relevant to the agent’s responsibilities.
  • Highlight the important points the agent should always consider while performing tasks.
  • Use clear, actionable language to avoid ambiguity.
Good Instructions:
instructions="Always check for GDPR compliance when handling user data."
instructions="Summarize product trends in bullet points for easy review."
instructions="Use WebSearch before WebRead to ensure you have the most up-to-date information."
Bad Instructions:
instructions="Be careful."
instructions="Analyze the data."
instructions="Do your best."
If you want to learn everything about Agent Concepts use here.

Let’s create an Agent that analyzes Merchant Websites

At Upsonic, we strive for simplicity and safety. With this in mind, we offer an Agent class that allows us to configure our Agent for specific purposes. In this example, we will create an agent that identifies merchant websites and analyzes their product offerings.
# Upsonic Docs: Create an Agent
# https://docs.upsonic.ai/guides/create_an_agent



# Imports
from upsonic import Agent


# Agent Creation
merchant_analyzer_agent = Agent(
    name="Merchant Analyzer V1",
    role="Analyzing the merchant websites.",
    goal="Getting the right result on their websites and giving as the user requested format",
    instructions="""
    Identify and analyze the product's category and brand.
    If the product belongs to a series, pay special attention to series details and related products.
    Summarize product trends in clear bullet points for easy review.
    Prioritize up-to-date information and highlight any unusual patterns or outliers.
    """
)

Need more advanced features?

The Agent framework provides several advanced configurations to tailor your agent’s behavior:
  • Multi-Agent Collaboration: Design teams of agents that collaborate on complex tasks, each with specialized roles and goals.
  • Reasoning Enhancements: Incorporate advanced reasoning tools that analyze operations, create plans, and make revisions to improve decision-making.
  • Memory Integration: Equip agents with memory capabilities to store and retrieve information from previous interactions, enabling personalized responses.
  • Knowledge Base Access: Connect agents to extensive knowledge bases for Retrieval Augmented Generation (RAG), enhancing information retrieval and accuracy.
  • Tool Integration: Extend agent functionality by integrating custom tools that interact with external systems and perform specialized operations.
  • Context Compression: Implement context compression strategies (simple, LLMLingua) to manage large conversation histories efficiently.
  • Safety Policies: Apply user and agent policies to ensure safe and compliant interactions with built-in guardrails.
  • Streaming Capabilities: Enable real-time streaming responses for interactive applications and better user experience.
  • Reflection Processing: Integrate self-evaluation and reflection capabilities to improve response quality over time.
For detailed examples and advanced patterns, see our comprehensive Agent Concept Documentation.
I