Engineering
gemini-audio avatar

gemini-audio

Implement Google Gemini API audio capabilities: process, transcribe, and summarize audio files, analyze environmental sounds, and generate natural speech with controllable TTS.

Installation

Agent type

Claude Code

Install Command (macOS)
curl -fsSL "https://mentalok.io/api/v1/skills/gemini-audio/install?os=mac&agent=claude" | bash
Install Command (Windows)
curl -L "https://mentalok.io/api/v1/skills/gemini-audio/install?os=windows&agent=claude" -o install-gemini-audio.bat && install-gemini-audio.bat

Download Skill Project

/agent-skill/gemini-audio