01_AGENT_SYSTEM

Shared from "Study" on Inkdown

Agent System - Comprehensive Deep Dive

Overview

The Agent system is the heart of the OpenAI Agents SDK. An Agent represents an AI assistant that can be configured with instructions, tools, guardrails, handoffs, and more. Think of an Agent as a "persona" or "role" that the LLM adopts, equipped with specific capabilities and constraints.

Core Classes

AgentBase

AgentBase is the base class for all agents. It provides the foundational attributes that are shared across different agent types (including Agent and RealtimeAgent).

instructions: str | Callable | None - The system prompt for the agent. This is the most important attribute as it defines the agent's behavior, personality, and capabilities. It can be:
- A static string
- A function that dynamically generates instructions based on context
- None (no specific instructions)
prompt: Prompt | DynamicPromptFunction | None - A more advanced way to configure prompts using OpenAI's Prompt API. This allows dynamic configuration of instructions, tools, and other settings outside of your code. Only usable with OpenAI models using the Responses API.
handoffs: list[Agent | Handoff] - A list of sub-agents or handoff configurations that this agent can delegate to. This enables multi-agent workflows where specialized agents handle specific tasks.
model: str | Model | None - The model to use for this agent. If not specified, it uses the default model (currently "gpt-4.1"). You can specify:
- A string model name (e.g., "gpt-4o")
- A custom Model instance
model_settings: ModelSettings - Model-specific tuning parameters like temperature, top_p, max tokens, etc. These control the randomness and creativity of the model's responses.
input_guardrails: list[InputGuardrail] - Guardrails that run before the agent processes input. These are safety checks that can validate, filter, or reject input before it reaches the LLM.
output_guardrails: list[OutputGuardrail] - Guardrails that run after the agent produces output. These validate the final output to ensure it meets safety or quality standards.
output_type: type | AgentOutputSchemaBase | None - The expected type of the output. If not specified, output is a string. You can specify:
- A Python type (dataclass, Pydantic model, TypedDict, etc.)
- A custom AgentOutputSchemaBase for custom JSON schemas
- AgentOutputSchema with strict_json_schema=False for non-strict schemas
hooks: AgentHooks | None - A class that receives callbacks on various lifecycle events for this specific agent. This allows you to hook into the agent's execution to add custom logic.
tool_use_behavior: Literal | StopAtTools | Callable - Controls how tool use is handled:
- "run_llm_again" (default) - Tools are executed, then the LLM receives the results and can respond again
- "stop_on_first_tool" - The first tool's output is treated as the final result
- StopAtTools dict - Stop if specific tools are called
- Custom function - Fine-grained control over tool-to-output logic
reset_tool_choice: bool - Whether to reset tool choice after a tool call. Defaults to True to prevent infinite loops of tool usage.

Python

research_agent = Agent(name="researcher", instructions="Research information...")
main_agent = Agent(
    name="coordinator",
    instructions="Coordinate tasks",
    tools=[research_agent.as_tool("research_tool", "Research a topic")]
)

Python

from dataclasses import dataclass
from agents import Agent, function_tool, RunContextWrapper

@dataclass
class MyContext:
    user_id: str
    session_data: dict

@function_tool
def get_user_preferences(context: RunContextWrapper[MyContext]) -> str:
    user_id = context.context.user_id
    # Fetch preferences for this user
    return f"Preferences for {user_id}"

agent = Agent(
    name="personalized_agent",
    instructions="Help the user with personalized recommendations",
    tools=[get_user_preferences],
)

# When running
context = MyContext(user_id="123", session_data={})
result = await Runner.run(agent, "What do you recommend?", context=context)

Python

def dynamic_instructions(
    context: RunContextWrapper[MyContext],
    agent: Agent[MyContext],
) -> str:
    user_id = context.context.user_id
    return f"You are helping user {user_id}. Be friendly and helpful."

agent = Agent(
    name="dynamic_agent",
    instructions=dynamic_instructions,
)

Python

from agents import Agent, function_tool

@function_tool
def calculate_sum(a: int, b: int) -> int:
    return a + b

agent = Agent(
    name="math_agent",
    instructions="Help with math calculations",
    tools=[calculate_sum],
)

Python

from agents import Agent, function_tool, tool_namespace

@tool_namespace("math")
@function_tool
def add(a: int, b: int) -> int:
    return a + b

@tool_namespace("math")
@function_tool
def multiply(a: int, b: int) -> int:
    return a * b

agent = Agent(
    name="calculator",
    instructions="Use math tools for calculations",
    tools=[add, multiply],
)

Python

@function_tool
def admin_function(context: RunContextWrapper[MyContext]) -> str:
    return "Admin-only function"

agent = Agent(
    name="admin_agent",
    instructions="Admin functions",
    tools=[admin_function],
)

# The tool can have an is_enabled function
admin_function.is_enabled = lambda ctx, agent: ctx.context.user_id == "admin"

Python

from agents import Agent, handoff

specialist_agent = Agent(
    name="specialist",
    instructions="Handle specialized tasks",
    handoff_description="Specialist for complex technical issues",
)

general_agent = Agent(
    name="general",
    instructions="Handle general queries",
    handoffs=[handoff(specialist_agent)],
)

Python

def filter_handoff_input(data: HandoffInputData) -> HandoffInputData:
    # Only pass the last 5 items
    return data.clone(new_items=data.new_items[-5:])

agent = Agent(
    name="general",
    instructions="General agent",
    handoffs=[handoff(specialist_agent, input_filter=filter_handoff_input)],
)

Python

from agents import Agent, input_guardrail, InputGuardrailFunctionOutput

@input_guardrail
def check_off_topic(
    context: RunContextWrapper,
    agent: Agent,
    input: str | list[TResponseInputItem],
) -> InputGuardrailFunctionOutput:
    if isinstance(input, str) and "politics" in input.lower():
        return InputGuardrailFunctionOutput(
            output_info="Political content detected",
            tripwire_triggered=True,
        )
    return InputGuardrailFunctionOutput(
        output_info="Input is appropriate",
        tripwire_triggered=False,
    )

agent = Agent(
    name="safe_agent",
    instructions="Help with safe topics",
    input_guardrails=[check_off_topic],
)

Python

from agents import Agent, output_guardrail

@output_guardrail
def check_output_length(
    context: RunContextWrapper,
    agent: Agent,
    agent_output: str,
) -> InputGuardrailFunctionOutput:
    if len(agent_output) > 1000:
        return InputGuardrailFunctionOutput(
            output_info="Output too long",
            tripwire_triggered=True,
        )
    return InputGuardrailFunctionOutput(
        output_info="Output length acceptable",
        tripwire_triggered=False,
    )

agent = Agent(
    name="concise_agent",
    instructions="Be concise",
    output_guardrails=[check_output_length],
)

Python

from agents import Agent, ModelSettings

agent = Agent(
    name="creative_agent",
    instructions="Be creative",
    model_settings=ModelSettings(
        temperature=0.9,  # More creative
        top_p=0.9,
    ),
)

Python

from dataclasses import dataclass
from agents import Agent

@dataclass
class Summary:
    title: str
    points: list[str]

agent = Agent(
    name="summarizer",
    instructions="Summarize the input",
    output_type=Summary,
)
# Output is a Summary dataclass instance

Python

from agents import Agent, AgentOutputSchema

custom_schema = AgentOutputSchema(
    type="object",
    properties={
        "answer": {"type": "string"},
        "confidence": {"type": "number"},
    },
)

agent = Agent(
    name="structured_agent",
    instructions="Provide structured answers",
    output_type=custom_schema,
)

Python

from agents import Agent, StopAtTools

agent = Agent(
    name="controlled_agent",
    instructions="Use tools carefully",
    tool_use_behavior=StopAtTools(stop_at_tool_names=["sensitive_tool"]),
)
# Stops if sensitive_tool is called

Python

from agents import Agent, ToolsToFinalOutputResult, ToolsToFinalOutputFunction

async def custom_tool_handler(
    context: RunContextWrapper,
    tool_results: list[FunctionToolResult],
) -> ToolsToFinalOutputResult:
    # Custom logic to decide if tool results are final
    if any("error" in r.output for r in tool_results):
        return ToolsToFinalOutputResult(is_final_output=True, final_output="Error occurred")
    return ToolsToFinalOutputResult(is_final_output=False)

agent = Agent(
    name="custom_agent",
    instructions="Custom tool handling",
    tool_use_behavior=custom_tool_handler,
)

Python

from agents import Agent, AgentHooks

class MyAgentHooks(AgentHooks):
    async def on_start(self, context, agent):
        print(f"Agent {agent.name} starting")
    
    async def on_end(self, context, agent, output):
        print(f"Agent {agent.name} finished with output: {output}")

agent = Agent(
    name="hooked_agent",
    instructions="Agent with hooks",
    hooks=MyAgentHooks(),
)

Python

# Good
agent = Agent(
    name="summarizer",
    instructions="You are a summarization expert. Given a text, provide a concise summary in 3 bullet points. Each bullet should be under 20 words.",
)

# Avoid
agent = Agent(
    name="vague_agent",
    instructions="Summarize things",  # Too vague
)

Python

# Good - focused context
@dataclass
class UserContext:
    user_id: str
    preferences: dict

# Avoid - bloated context
@dataclass
class EverythingContext:
    user_id: str
    preferences: dict
    session_history: list
    cache: dict
    database_connection: Any  # Don't put heavy resources here

Python

# Input guardrail - for filtering what comes in
@input_guardrail
def check_safety(input: str) -> GuardrailFunctionOutput:
    ...

# Output guardrail - for validating what goes out
@output_guardrail
def check_quality(output: str) -> GuardrailFunctionOutput:
    ...

# Tool guardrail - for specific tool validation
@tool_input_guardrail
def check_tool_args(args: dict) -> ToolGuardrailFunctionOutput:
    ...

Python

coder = Agent(name="coder", instructions="Write code", ...)
researcher = Agent(name="researcher", instructions="Research topics", ...)
writer = Agent(name="writer", instructions="Write content", ...)

coordinator = Agent(
    name="coordinator",
    instructions="Coordinate tasks",
    handoffs=[handoff(coder), handoff(researcher), handoff(writer)],
)

Python

# First agent processes input
agent1 = Agent(name="step1", instructions="Process step 1")

# Second agent takes output of first
agent2 = Agent(name="step2", instructions="Process step 2")

result1 = await Runner.run(agent1, input)
result2 = await Runner.run(agent2, result1.final_output)

Python

import asyncio

agent1 = Agent(name="parallel1", instructions="Task 1")
agent2 = Agent(name="parallel2", instructions="Task 2")

results = await asyncio.gather(
    Runner.run(agent1, input),
    Runner.run(agent2, input),
)

Python

from agents import InputGuardrailTripwireTriggered, OutputGuardrailTripwireTriggered

try:
    result = await Runner.run(agent, input)
except InputGuardrailTripwireTriggered as e:
    print(f"Input guardrail triggered: {e}")
except OutputGuardrailTripwireTriggered as e:
    print(f"Output guardrail triggered: {e}")

Python

base_agent = Agent(
    name="base",
    instructions="Base instructions",
    tools=[base_tool],
)

# Create variations
agent_a = base_agent.clone(name="agent_a", instructions="A-specific instructions")
agent_b = base_agent.clone(name="agent_b", instructions="B-specific instructions")

Python

from agents import Agent, MCPServerStdio

mcp_server = MCPServerStdio(params=MCPServerStdioParams(...))

agent = Agent(
    name="mcp_agent",
    instructions="Use MCP tools",
    mcp_servers=[mcp_server],
)

01_AGENT_SYSTEM

Agent System - Comprehensive Deep Dive

Overview

Core Classes

AgentBase

01_AGENT_SYSTEM

Agent System - Comprehensive Deep Dive

Overview

Core Classes

AgentBase

Agent

Agent Lifecycle

1. Creation

2. Execution

3. Cloning

Agents as Tools

Context and Generics

Dynamic Instructions

Tool Management

Adding Tools

Tool Namespaces

Dynamic Tool Enablement

Handoff Management

Adding Handoffs

Handoff Input Filters

Guardrails

Input Guardrails

Output Guardrails

Model Configuration

Model Selection

Model Settings

GPT-5 Special Handling

Output Types

String Output (Default)

Structured Output

Custom Schema

Tool Use Behavior

Default Behavior (run_llm_again)

Stop on First Tool

Stop at Specific Tools

Custom Behavior

Lifecycle Hooks

Best Practices

1. Clear Instructions

2. Tool Naming

3. Handoff Descriptions

4. Context Design

5. Guardrail Granularity

Common Patterns

1. Specialist Pattern

2. Supervisor Pattern

3. Sequential Pattern

4. Parallel Pattern

Error Handling in Agents

Model Behavior Errors

Tool Errors

Guardrail Tripwires

Advanced Topics

Agent Cloning for Variations

Dynamic Tool Addition

MCP Integration

Summary