ai-multimodal

Process and generate multimedia content using Google Gemini API for better vision capabilities. Capabilities include analyze audio files (transcription with timestamps, summarization, speech understan

by ngocsangyem· Repository·other
Also installable via skills CLI
npx skills add ngocsangyem/ngocsangyem.dev/.claude/skills/ai-multimodal

Source

Path:.claude/skills/ai-multimodal(main)

Related in other

ai-multimodal | AgentArea Skills