Engineering
gemini-audio
Implement Google Gemini API audio capabilities: process, transcribe, and summarize audio files, analyze environmental sounds, and generate natural speech with controllable TTS.
Installation
Agent type
Claude Code
Install Command (macOS)
curl -fsSL "https://mentalok.io/api/v1/skills/gemini-audio/install?os=mac&agent=claude" | bash
Install Command (Windows)
curl -L "https://mentalok.io/api/v1/skills/gemini-audio/install?os=windows&agent=claude" -o install-gemini-audio.bat && install-gemini-audio.bat
Download Installer
Download Skill Project