model-architecture-nanogpt
Educational GPT implementation in ~300 lines. Reproduces GPT-2 (124M) on OpenWebText. Clean, hackable code for learning transformers. By Andrej Karpathy. Perfect for understanding GPT architecture fro
Also installable via skills CLI
npx skills add davila7/claude-code-templates/data/model-architecture-nanogpt