audio-language-models

Gemini Live API, Grok Voice Agent, GPT-4o-Transcribe, AssemblyAI patterns for real-time voice, speech-to-text, and TTS. Use when implementing voice agents, audio transcription, or conversational AI.

by yonatangross· Repository·other
Also installable via skills CLI
npx skills add yonatangross/skillforge-claude-plugin/skills/audio-language-models

Source

Path:skills/audio-language-models(main)

Related in other