Productivity
ai-multimodal avatar

ai-multimodal

Process and generate multimedia with Google Gemini. Analyze audio, images, videos, and PDFs with high-context windows. Supports transcription, visual QA, OCR, and AI-driven image creation.

Installation

Agent type

Claude Code

Install Command (macOS)
curl -fsSL "https://mentalok.io/api/v1/skills/ai-multimodal/install?os=mac&agent=claude" | bash
Install Command (Windows)
curl -L "https://mentalok.io/api/v1/skills/ai-multimodal/install?os=windows&agent=claude" -o install-ai-multimodal.bat && install-ai-multimodal.bat

Download Skill Project

/agent-skill/ai-multimodal