ai-multimodal
Analyze images/audio/video with Gemini API (better vision than Claude). Generate images (Imagen 4), videos (Veo 3). Use for vision analysis, transcription, OCR, design extraction, multimodal AI.
Also installable via skills CLI
npx skills add JorgeZuloaga/audio-dev-mcps/mcp-rew/.opencode/skills/ai-multimodal
Source
Path:
mcp-rew/.opencode/skills/ai-multimodal/SKILL.md(main)