ai-multimodal

Process and generate multimedia content using Google Gemini API for better vision capabilities. Capabilities include analyze audio files (transcription with timestamps, summarization, speech understan

by captain-corgi· Repository·other
Also installable via skills CLI
npx skills add captain-corgi/marketplace-website/.claude/skills/ai-multimodal

Source

Path:.claude/skills/ai-multimodal/SKILL.md(main)

Related in other

ai-multimodal | AgentArea Skills