audio-language-models
Gemini Live API, Grok Voice Agent, GPT-4o-Transcribe, AssemblyAI patterns for real-time voice, speech-to-text, and TTS. Use when implementing voice agents, audio transcription, or conversational AI.
Also installable via skills CLI
npx skills add yonatangross/skillforge-claude-plugin/skills/audio-language-models