Guide for implementing Google Gemini API audio capabilities - analyze audio with transcription, summarization, and understanding (up to 9.5 hours), plus generate speech with controllable TTS. Use when
collection/kienhaminh__speed-reader__claude__skills__gemini-audio__SKILL.md(main)