ai-multimodal

Process and generate multimedia content using Google Gemini API for better vision capabilities. Capabilities include analyze audio files (transcription with timestamps, summarization, speech understan

by CongDon1207· Repository·other
Also installable via skills CLI
npx skills add CongDon1207/AGENTS.md/.claude/skills/ai-multimodal

Source

Path:.claude/skills/ai-multimodal/SKILL.md(main)

Related in other

ai-multimodal | AgentArea Skills