whisper
OpenAI's general-purpose speech recognition model. Supports 99 languages, transcription, translation to English, and language identification. Six model sizes from tiny (39M params) to large (1550M par
Also installable via skills CLI
npx skills add zechenzhangAGI/AI-research-SKILLs/18-multimodal/whisper