03-query-system

Query System Architecture

Overview

The Query System is the heart of Claude Code - it manages the conversation with Claude's API, handles tool execution, and orchestrates the entire interaction flow.

Plain text

┌─────────────────────────────────────────────────────────────────────────────┐
│                           QUERY SYSTEM                                       │
├─────────────────────────────────────────────────────────────────────────────┤
│                                                                             │
│  ┌──────────────┐    ┌──────────────┐    ┌──────────────┐                │
│  │   Query      │───►│   Query      │───►│   Claude     │                │
│  │   Engine     │    │   (query.ts) │    │   API        │                │
│  │   (class)    │◄───│              │◄───│              │                │
│  └──────────────┘    └──────────────┘    └──────────────┘                │
│         │                   │                                               │
│         │            ┌──────▼──────┐                                     │
│         │            │  Streaming  │                                     │
│         │            │  Executor   │                                     │
│         │            └──────┬──────┘                                     │
│         │                   │                                               │
│         │            ┌──────▼──────┐                                     │
│         └───────────►│   Tool      │                                     │
│                      │  Execution  │                                     │
│                      └─────────────┘                                     │
│                                                                             │
└─────────────────────────────────────────────────────────────────────────────┘

┌─────────────────────────────────────────────────────────────────────────────┐ │ QUERY SYSTEM │ ├─────────────────────────────────────────────────────────────────────────────┤ │ │ │ ┌──────────────┐ ┌──────────────┐ ┌──────────────┐ │ │ │ Query │───►│ Query │───►│ Claude │ │ │ │ Engine │ │ (query.ts) │ │ API │ │ │ │ (class) │◄───│ │◄───│ │ │ │ └──────────────┘ └──────────────┘ └──────────────┘ │ │ │ │ │ │ │ ┌──────▼──────┐ │ │ │ │ Streaming │ │ │ │ │ Executor │ │ │ │ └──────┬──────┘ │ │ │ │ │ │ │ ┌──────▼──────┐ │ │ └───────────►│ Tool │ │ │ │ Execution │ │ │ └─────────────┘ │ │ │ └─────────────────────────────────────────────────────────────────────────────┘

File	Purpose
`QueryEngine.ts`	Main class managing query lifecycle
`query.ts`	Core streaming logic and API communication
`query/config.ts`	Query configuration building
`query/deps.ts`	Dependency injection for testing
`query/transitions.ts`	State machine transitions
`query/tokenBudget.ts`	Token limit management

┌─────────────────────────────────────────────────────────────────┐ │ QUERY LIFECYCLE │ ├─────────────────────────────────────────────────────────────────┤ │ │ │ 1. SUBMIT │ │ UserMessage → add to messages → call API │ │ │ │ 2. STREAM │ │ Receive chunks from Claude API │ │ ├─ Text blocks → yield message events │ │ ├─ Tool use blocks → queue for execution │ │ └─ Error blocks → handle errors │ │ │ │ 3. EXECUTE TOOLS │ │ For each ToolUseBlock: │ │ ├─ Check permissions │ │ ├─ Execute tool │ │ ├─ Collect results │ │ └─ Add ToolResultBlock to messages │ │ │ │ 4. CONTINUE? │ │ If more tool uses → go back to step 2 │ │ If max turns reached → stop with warning │ │ If completion → yield final result │ │ │ │ 5. COMPLETE │ │ Return final QueryResult │ │ │ └─────────────────────────────────────────────────────────────────┘

┌─────────────────────────────────────────────────────────────────┐ │ TOOL EXECUTION │ ├─────────────────────────────────────────────────────────────────┤ │ │ │ ┌──────────────┐ │ │ │ ToolUseBlock │ │ │ │ from API │ │ │ └──────┬───────┘ │ │ │ │ │ ▼ │ │ ┌──────────────┐ ┌──────────────┐ ┌──────────────┐ │ │ │ Find Tool │────►│ Check │────►│ Execute │ │ │ │ by Name │ │ Permissions │ │ Tool │ │ │ └──────────────┘ └──────────────┘ └──────┬───────┘ │ │ │ │ │ ┌─────────────────────┘ │ │ │ │ │ ▼ │ │ ┌─────────────────┐ │ │ │ Tool Progress │ │ │ │ (yield events) │ │ │ └────────┬────────┘ │ │ │ │ │ ▼ │ │ ┌─────────────────┐ │ │ │ ToolResultBlock│ │ │ │ (add to msgs) │ │ │ └─────────────────┘ │ │ │ └─────────────────────────────────────────────────────────────────┘

┌─────────────────────────────────────────────────────────────────┐ │ STREAMING FLOW │ ├─────────────────────────────────────────────────────────────────┤ │ │ │ API Response (SSE/Streaming) │ │ │ │ │ ▼ │ │ ┌─────────────┐ │ │ │ Raw Chunks │ "{\"type\": \"content_block_delta\"...}" │ │ └──────┬──────┘ │ │ │ │ │ ▼ │ │ ┌─────────────┐ │ │ │ Parse JSON │ │ │ └──────┬──────┘ │ │ │ │ │ ▼ │ │ ┌─────────────┐ ┌─────────────┐ ┌─────────────┐ │ │ │ Text Delta │────►│ Update │────►│ Yield to │ │ │ │ │ │ Assistant │ │ Generator │ │ │ └─────────────┘ │ Message │ └─────────────┘ │ │ └─────────────┘ │ │ │ │ ┌─────────────┐ ┌─────────────┐ ┌─────────────┐ │ │ │ Tool Use │────►│ Create │────►│ Yield │ │ │ │ Start │ │ ToolUseBlock│ │ Tool Event │ │ │ └─────────────┘ └─────────────┘ └─────────────┘ │ │ │ └─────────────────────────────────────────────────────────────────┘

┌─────────────────────────────────────────────────────────────────┐ │ CONTEXT WINDOW FLOW │ ├─────────────────────────────────────────────────────────────────┤ │ │ │ Input Messages │ │ │ │ │ ▼ │ │ ┌─────────┐ Too large? ┌─────────┐ │ │ │ Measure │───────────────►│ Compact │ │ │ │ Tokens │ │ (Summarize) │ │ └────┬────┘ └────┬────┘ │ │ │ No │ │ │ ▼ ▼ │ │ ┌─────────┐ ┌─────────┐ │ │ │ Send │ │ Send │ │ │ │ to API │ │ to API │ │ │ └─────────┘ └─────────┘ │ │ │ │ Compaction triggers: │ │ - Token count exceeds threshold (~80% of model limit) │ │ - User hits soft limit warning │ │ - Auto-compact enabled and threshold reached │ │ │ └─────────────────────────────────────────────────────────────────┘

Error Type	Strategy
Rate Limit	Wait and retry with backoff
Server Error	Retry with exponential backoff
Context Length	Compact and retry
Max Output	Truncate and continue
Invalid Tool	Return error to LLM

Query System Architecture

Overview

03-query-system

Query System Architecture

Overview

Core Files

Query Engine Class

Key Insight

Query Flow

Query Function (Streaming Core)

Tool Execution Flow

Tool Execution Code

Streaming Architecture

Why Generators?

Message Compilation

Context Window Management

Compaction Types

Error Handling

Recovery Strategies

Configuration Building

Testing the Query System

Key Concepts

1. Turn-Based

2. Recursive for Tool Loops

3. Streaming is Primary

4. Stateless Core

5. Compaction is Automatic

Related Documentation