ai-multimodal

Process and generate multimedia content using Google Gemini API. Capabilities include analyze audio files (transcription with timestamps, summarization, speech understanding, music/sound analysis up t

by Microck· Repository·other
Also installable via skills CLI
npx skills add Microck/ordinary-claude-skills/skills_all/ai-multimodal

Source

Path:skills_all/ai-multimodal/SKILL.md(main)

Related in other

ai-multimodal | AgentArea Skills