ai-multimodal

Process and generate multimedia content using Google Gemini API. Capabilities include analyze audio files (transcription with timestamps, summarization, speech understanding, music/sound analysis up t

by agentgptsmith· Repository·other
Also installable via skills CLI
npx skills add agentgptsmith/MonadFramework/.claude/skills/ai-multimodal

Source

Path:.claude/skills/ai-multimodal/SKILL.md(main)

Related in other