11-testing-architecture

Testing Architecture

Overview

Claude Code has a multi-layered testing strategy that balances unit tests, integration tests, and "VCR" recorded API interactions. Given the complexity of LLM interactions, traditional mocking isn't enough - we need to record and replay actual API responses.

Plain text

┌─────────────────────────────────────────────────────────────────────────────┐
│                      TESTING PYRAMID                                       │
├─────────────────────────────────────────────────────────────────────────────┤
│                                                                             │
│                           ┌─────────┐                                      │
│                           │   E2E   │  Full CLI interactions               │
│                           │  Tests  │  (expensive, full API calls)          │
│                           └────┬────┘                                      │
│                                │                                           │
│                          ┌─────▼─────┐                                     │
│                          │   VCR     │  Recorded API interactions         │
│                          │  Tests    │  (deterministic, fast)             │
│                          └─────┬─────┘                                     │
│                                │                                           │
│                     ┌──────────▼──────────┐                               │
│                     │   Integration       │  Services, tools with mocks     │
│                     │      Tests          │                               │
│                     └──────────┬──────────┘                               │
│                                │                                           │
│              ┌─────────────────▼─────────────────┐                         │
│              │           Unit Tests            │  Pure functions, utils    │
│              │        (jest/bun test)          │                               │
│              └─────────────────────────────────┘                         │
│                                                                             │
└─────────────────────────────────────────────────────────────────────────────┘

Directory/File	Purpose
`tests/`	Test utilities, mocks, setup
`tests/mocks/`	Mock implementations
`tests/fixtures/`	VCR recordings, test data
`services/vcr.ts`	VCR recording/replay
`utils/testing/`	Test helpers
`*/__tests__/.test.ts`	Unit tests (co-located)

┌─────────────────────────────────────────────────────────────────┐ │ VCR FLOW │ ├─────────────────────────────────────────────────────────────────┤ │ │ │ RECORD MODE │ │ ──────────── │ │ │ │ Test calls API ──► Real API call ──► Save response │ │ │ │ │ │ ▼ ▼ │ │ Actual tokens To fixtures/ │ │ (costs $) test-name.json │ │ │ │ REPLAY MODE │ │ ──────────── │ │ │ │ Test calls API ──► Match request ──► Return recorded │ │ │ response │ │ ▼ │ │ Hash of: No API call │ │ - endpoint Deterministic │ │ - body Fast │ │ - headers │ │ │ └─────────────────────────────────────────────────────────────────┘

Testing Architecture

Overview

Core Testing Files

VCR Testing System

How VCR Works

VCR Implementation

Using VCR in Tests

VCR Fixtures

Mock System

Service Mocks

Tool Mocks

Context Mocks

Test Patterns

Unit Test Pattern

Integration Test Pattern

Query Engine Test with VCR

E2E Testing

CLI E2E Tests

Snapshot Testing

Test Data Builders

Running Tests

Test Isolation

Per-Test State Reset

Test Database/Files

Performance Testing

Key Testing Principles

Debugging Failed Tests