ai-multimodal

Analyze images/audio/video with Gemini API (better vision than Claude). Generate images (Imagen 4), videos (Veo 3). Use for vision analysis, transcription, OCR, design extraction, multimodal AI.

by hotriluan· Repository·other
Also installable via skills CLI
npx skills add hotriluan/alkana_kpi/.opencode/skills/ai-multimodal

Source

Path:.opencode/skills/ai-multimodal/SKILL.md(main)

Related in other

ai-multimodal | AgentArea Skills