tldr

════════════════════════════════════════

Claude Mythos Clone Shocks Anthropic and OpenAI

AI Revolution · 12:55 · 20260421

What This Is Actually About

A 22-year-old reverse-engineered Anthropic's secretive Claude Mythos architecture into an open-source implementation called Open Mythos. The core innovation replaces deep stacked layers with a recurrent loop that reuses weights multiple times, challenging the assumption that bigger models require more parameters. Research shows this approach can match larger transformers with nearly half the parameters while enabling deeper reasoning during inference.

Key Points

Open Mythos Architecture

Kai Gomez built Open Mythos as a fully open-source PyTorch implementation based on public research and speculation about Claude Mythos. The architecture uses a Recurrent Depth Transformer (RDT) that runs a smaller set of layers up to 16 times instead of stacking hundreds of layers with different weights.

How RDT Works

RDT consists of three parts: a prelude that encodes the input once, a recurrent block that loops multiple times, and a coda that produces the output. Each iteration updates the hidden state using the previous state, the original input signal, and the transformer computation. The input is reinjected every loop to prevent the model from drifting away from what it was supposed to process.

════════════════════════════════════════

Claude Mythos Clone Shocks Anthropic and OpenAI

What This Is Actually About

Key Points

Open Mythos Architecture

How RDT Works

tldr

Claude Mythos Clone Shocks Anthropic and OpenAI

What This Is Actually About

Key Points

Open Mythos Architecture

How RDT Works

Efficiency Through Mixture of Experts

Parameter Efficiency

Latent Space Reasoning

Systematic Generalization

Stability Mechanisms

Memory Efficiency

Scaling Hypothesis

Moonshot AI Kimiko 2.6

Kimiko Benchmark Performance

XAI Voice APIs

XAI Performance Claims

If You Remember Nothing Else

Watch Out For