ai-multimodal

Process and generate multimedia content using Google Gemini API for better vision capabilities. Capabilities include analyze audio files (transcription with timestamps, summarization, speech understan

by The1Studio· Repository·other
Also installable via skills CLI
npx skills add The1Studio/theone-training-skills/.claude/skills/ai-multimodal

Source

Path:.claude/skills/ai-multimodal/SKILL.md(main)

Related in other

ai-multimodal | AgentArea Skills