ai-multimodal

Process and generate multimedia content using Google Gemini API for better vision capabilities. Capabilities include analyze audio files (transcription with timestamps, summarization, speech understan

by danielctc· Repository·other
Also installable via skills CLI
npx skills add danielctc/ReactSpacesMonoRepo/.claude/skills/ai-multimodal

Source

Path:.claude/skills/ai-multimodal/SKILL.md(main)

Related in other

ai-multimodal | AgentArea Skills