Engineering
gemini-vision avatar

gemini-vision

Implement Google Gemini API vision capabilities for image/document analysis including captioning, object detection, segmentation, and multi-image comparison.

Installation

Agent type

Claude Code

Install Command (macOS)
curl -fsSL "https://mentalok.io/api/v1/skills/gemini-vision/install?os=mac&agent=claude" | bash
Install Command (Windows)
curl -L "https://mentalok.io/api/v1/skills/gemini-vision/install?os=windows&agent=claude" -o install-gemini-vision.bat && install-gemini-vision.bat

Download Skill Project

/agent-skill/gemini-vision