activation-patching
Causal intervention via activation patching to identify important model components. Use when determining which layers, heads, or positions are causally responsible for model behavior.
Also installable via skills CLI
npx skills add ndif-team/skills/data/activation-patching