LeetDreamer

Generate narrated algorithm visualization videos from JSON specs.

python build.py scenes/two_sum/hashtable/scene.json -v

Audio-First Pipeline

LeetDreamer generates TTS audio first, extracts durations, then drives animation timing from those durations. Narration and visuals stay perfectly synchronized without manual tweaking.

Single Source of Truth

One JSON file controls both animation state and narration text. No sync issues.

Adapter Architecture

Swap TTS engines, recording backends, and merge strategies without changing scene specs.

Schema Validation

Pydantic validates every scene spec. Catch errors before rendering starts.

Jinja2 Templates

HTML templates receive step data and timing. Full control over visualization style.

How It Works

Parse scene spec — Load JSON, validate with Pydantic schema
Generate audio — TTS adapter creates WAV files for each narration step
Extract durations — Measure each audio clip to drive timing
Render animation — Jinja2 template produces timed HTML animation
Record video — Playwright captures the animation to WebM
Merge tracks — FFmpeg combines video + audio into final MP4

Next Steps

Read the Architecture docs to understand the pipeline internals, or jump to Usage to start building scenes.