ai-multimodal

Process and generate multimedia content using Google Gemini API for better vision capabilities. Capabilities include analyze audio files (transcription with timestamps, summarization, speech understan

by binjuhor· Repository·other
Also installable via skills CLI
npx skills add binjuhor/shadcn-lar/.claude/skills/ai-multimodal

Source

Path:.claude/skills/ai-multimodal/SKILL.md(main)

Related in other

ai-multimodal | AgentArea Skills