ai-multimodal

Process and generate multimedia content using Google Gemini API for better vision capabilities. Capabilities include analyze audio files (transcription with timestamps, summarization, speech understan

by Knguyen-data· Repository·other
Also installable via skills CLI
npx skills add Knguyen-data/Right-IELTS-information-system/.claude/skills/ai-multimodal

Source

Path:.claude/skills/ai-multimodal(master)

Related in other

ai-multimodal | AgentArea Skills