ai-multimodal

Analyze images/audio/video with Gemini API (better vision than Claude). Generate images (Imagen 4), videos (Veo 3). Use for vision analysis, transcription, OCR, design extraction, multimodal AI.

by NhiLe-Team-Webs· Repository·other
Also installable via skills CLI
npx skills add NhiLe-Team-Webs/nedu/.opencode/skills/ai-multimodal

Source

Path:.opencode/skills/ai-multimodal/SKILL.md(main)

Related in other

ai-multimodal | AgentArea Skills