02_RUNNER_SYSTEM

Shared from "Study" on Inkdown

Runner System - Comprehensive Deep Dive

Overview

The Runner system is the execution engine of the OpenAI Agents SDK. It's responsible for orchestrating agent runs, managing the lifecycle of agent execution, handling tool execution, coordinating handoffs, managing sessions, and ensuring proper error handling. Think of the Runner as the "director" that brings together all the components (agents, tools, guardrails, etc.) and makes them work together in a coordinated way.

Core Classes

Runner

Runner is the main entry point for executing agents. It provides both async and sync interfaces for running agents.

Location: src/agents/run.py

Python

# Pseudocode of turn execution
for turn in range(max_turns):
    # 1. Prepare input
    input_items = prepare_input(current_state, new_input)
    
    # 2. Run input guardrails (first turn only, starting agent only)
    if turn == 0 and is_starting_agent:
        guardrail_results = await run_input_guardrails(input_items)
        if any guardrail.tripwire_triggered:
            raise InputGuardrailTripwireTriggered
    
    # 3. Call the model
    response = await model.get_response(
        instructions=agent.instructions,
        input=input_items,
        tools=available_tools,
        ...
    )
    
    # 4. Process the response
    processed = process_response(response)
    
    # 5. Handle tool calls
    if processed.tool_calls:
        tool_results = await execute_tools(processed.tool_calls)
        # Add tool results to state for next turn
        state.add_tool_results(tool_results)
        continue to next turn
    
    # 6. Handle handoffs
    if processed.handoff:
        next_agent = processed.handoff.target_agent
        switch_to_agent(next_agent)
        continue to next turn
    
    # 7. Final output
    if processed.final_output:
        # Run output guardrails
        guardrail_results = await run_output_guardrails(processed.final_output)
        if any guardrail.tripwire_triggered:
            raise OutputGuardrailTripwireTriggered
        
        return RunResult(final_output=processed.final_output, ...)

String input - Simple text input

Python

await Runner.run(agent, "Hello")
# Converted to: [{"type": "user", "content": "Hello"}]

List input - Structured input with multiple items

Python

await Runner.run(agent, [
    {"type": "user", "content": "Hello"},
    {"type": "user", "content": {"type": "image_url", "image_url": "..."}}
])

RunState input - Resume from a paused state

Python

state = previous_run.to_state()
await Runner.run(agent, state)  # Resumes from where it left off

Python

model_response = await model.get_response(
    system_instructions=agent.instructions,
    input=input_items,
    model_settings=resolved_model_settings,
    tools=available_tools,
    output_schema=agent.output_type,
    handoffs=agent.handoffs,
    tracing=tracing_config,
    previous_response_id=previous_response_id,
    conversation_id=conversation_id,
    prompt=agent.prompt,
)

Python

@dataclass
class ProcessedResponse:
    content: str | None
    tool_calls: list[ToolCall]
    handoff: Handoff | None
    reasoning: ReasoningItem | None
    refusal: str | None
    raw_response: ModelResponse

Python

for tool_call in processed_response.tool_calls:
    # 1. Find the tool
    tool = find_tool(tool_call.name)
    
    # 2. Check if approval is needed
    if tool.needs_approval:
        approval = await request_approval(tool_call)
        if not approval:
            record_rejection(tool_call)
            continue
    
    # 3. Run tool guardrails (if configured)
    guardrail_result = await run_tool_input_guardrail(tool, tool_call.arguments)
    if guardrail_result.tripwire_triggered:
        handle_guardrail_tripwire(guardrail_result)
        continue
    
    # 4. Execute the tool
    try:
        result = await tool.execute(tool_call.arguments, context)
    except Exception as e:
        result = handle_tool_error(e, tool)
    
    # 5. Run tool output guardrails (if configured)
    guardrail_result = await run_tool_output_guardrail(tool, result)
    if guardrail_result.tripwire_triggered:
        handle_guardrail_tripwire(guardrail_result)
        result = guardrail_result.output_info
    
    # 6. Record the result
    record_tool_output(tool_call, result)

Python

if processed_response.handoff:
    handoff = processed_response.handoff
    
    # 1. Invoke the handoff
    next_agent = await handoff.on_invoke_handoff(context, handoff_arguments)
    
    # 2. Apply input filter (if configured)
    if handoff.input_filter:
        handoff_input = build_handoff_input(current_state, handoff)
        filtered_input = await handoff.input_filter(handoff_input)
    else:
        filtered_input = default_handoff_input(current_state, handoff)
    
    # 3. Switch to the new agent
    current_agent = next_agent
    
    # 4. Continue with next turn
    continue

Python

if processed_response.final_output:
    # Run output guardrails
    guardrail_results = []
    for guardrail in agent.output_guardrails + (run_config.output_guardrails or []):
        result = await guardrail.run(context, agent, processed_response.final_output)
        guardrail_results.append(result)
        
        if result.output.tripwire_triggered:
            raise OutputGuardrailTripwireTriggered(
                guardrail_result=result,
                output=processed_response.final_output,
            )
    
    # Return the result
    return RunResult(
        final_output=processed_response.final_output,
        output_guardrail_results=guardrail_results,
        ...
    )

Python

@dataclass
class RunResult:
    final_output: Any
    new_items: list[RunItem]
    raw_responses: list[ModelResponse]
    last_agent: Agent
    input_guardrail_results: list[InputGuardrailResult]
    output_guardrail_results: list[OutputGuardrailResult]
    tool_input_guardrail_results: list[ToolInputGuardrailResult]
    tool_output_guardrail_results: list[ToolOutputGuardrailResult]
    trace: Trace | None
    usage: Usage
    interruptions: list[ToolApprovalItem]
    ...

Python

async for event in Runner.run_streamed(agent, input):
    if isinstance(event, RunItemStreamEvent):
        print(f"Item: {event.item}")
    elif isinstance(event, AgentUpdatedStreamEvent):
        print(f"Agent updated: {event.agent.name}")
    elif isinstance(event, RawResponsesStreamEvent):
        print(f"Raw event: {event.event}")

Python

from agents import RunErrorHandlers

error_handlers = RunErrorHandlers(
    max_turns=lambda ctx, error: "Custom max turns message"
)

result = await Runner.run(
    agent,
    input,
    error_handlers=error_handlers,
)

Python

try:
    result = await Runner.run(agent, input)
except MaxTurnsExceeded as e:
    # Can resume with increased max_turns
    state = e.run_state
    result = await Runner.run(agent, state, max_turns=20)
except InputGuardrailTripwireTriggered as e:
    # Can retry with modified input
    result = await Runner.run(agent, modified_input)

Python

from agents import RunConfig, ModelSettings

config = RunConfig(
    model="gpt-4o",
    model_settings=ModelSettings(temperature=0.7),
    max_turns=20,
    tracing_disabled=False,
    workflow_name="My workflow",
)

result = await Runner.run(agent, input, run_config=config)

Python

from agents import RunHooks

class MyRunHooks(RunHooks):
    async def on_llm_start(self, context, agent, system_prompt, input_items):
        print(f"LLM call starting for {agent.name}")
    
    async def on_llm_end(self, context, agent, response):
        print(f"LLM call completed for {agent.name}")
    
    async def on_agent_start(self, context, agent):
        print(f"Agent {agent.name} starting")
    
    async def on_agent_end(self, context, agent, output):
        print(f"Agent {agent.name} finished")
    
    async def on_handoff(self, context, from_agent, to_agent):
        print(f"Handoff from {from_agent.name} to {to_agent.name}")
    
    async def on_tool_start(self, context, agent, tool):
        print(f"Tool {tool.name} starting")
    
    async def on_tool_end(self, context, agent, tool, result):
        print(f"Tool {tool.name} finished")

result = await Runner.run(agent, input, hooks=MyRunHooks())

Python

# First run - pauses for approval
result = await Runner.run(agent, input)
if result.interruptions:
    state = result.to_state()
    
    # Human reviews
    for interruption in result.interruptions:
        if should_approve(interruption):
            state.context.approve_tool(interruption)
        else:
            state.context.reject_tool(interruption)
    
    # Resume
    result = await Runner.run(agent, state)

Python

config = RunConfig(
    tracing_disabled=False,
    trace_include_sensitive_data=True,  # Include tool inputs/outputs
    workflow_name="My workflow",
    trace_id="custom-id",
    group_id="conversation-123",
    trace_metadata={"user_id": "123"},
)

Python

# These will execute in parallel
@function_tool
def tool1() -> str:
    time.sleep(1)
    return "tool1"

@function_tool
def tool2() -> str:
    time.sleep(1)
    return "tool2"

agent = Agent(tools=[tool1, tool2])

Python

try:
    result = await Runner.run(agent, input)
except MaxTurnsExceeded:
    # Handle gracefully
    return "I need more information to complete this task."
except InputGuardrailTripwireTriggered:
    # Handle gracefully
    return "I cannot process that request."

Python

async for event in Runner.run_streamed(agent, input):
    if isinstance(event, RunItemStreamEvent):
        if isinstance(event.item, MessageOutputItem):
            print(event.item.content[0].text)

02_RUNNER_SYSTEM

Runner System - Comprehensive Deep Dive

Overview

Core Classes

Runner

02_RUNNER_SYSTEM

Runner System - Comprehensive Deep Dive

Overview

Core Classes

Runner

AgentRunner

Execution Flow

1. Initialization

2. Turn Execution

3. Input Preparation

4. Model Call

5. Response Processing

6. Tool Execution

7. Handoff Execution

8. Output Guardrails

9. Session Persistence

10. Result Return

Streaming

Streamed Execution

Error Handling

Error Handlers

Error Types

Error Recovery

Run Configuration

RunConfig

Configuration Priority

Lifecycle Hooks

Run Hooks

Server-Managed Conversations

Human-in-the-Loop

Approval Workflow

Tracing Integration

Trace Creation

Usage Tracking

Token Usage

Performance Considerations

Async Execution

Parallel Tool Execution

Session Compaction

Best Practices

1. Use Async

2. Set Reasonable Max Turns

3. Use Sessions for Long Conversations

4. Enable Tracing for Debugging

5. Handle Errors Gracefully

Common Patterns

1. Multi-Agent Workflow

2. Streaming Response

3. Resume from State

4. Custom Error Handling

Summary