gemini
CLI interface for Gemini AI, enabling one-shot model inference, text generation, and JSON-formatted data extraction for OpenClaw users.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
150 skills found
CLI interface for Gemini AI, enabling one-shot model inference, text generation, and JSON-formatted data extraction for OpenClaw users.
Implement Google Gemini API vision capabilities for image/document analysis including captioning, object detection, segmentation, and multi-image comparison.
Perform advanced video analysis using Google's Gemini API: summarize content, transcribe audio, extract timestamps, clip segments, and analyze YouTube URLs or local files with support for multiple models and long contexts.
Implement Google Gemini API audio capabilities: process, transcribe, and summarize audio files, analyze environmental sounds, and generate natural speech with controllable TTS.
Gemini-powered UI design review, accessibility auditing, and design system validation tool for software agents.
Process and generate multimedia with Google Gemini. Analyze audio, images, videos, and PDFs with high-context windows. Supports transcription, visual QA, OCR, and AI-driven image creation.
Generate and edit images using the Gemini API via the nanaban CLI. Create illustrations, logos, and icons, or perform photo edits like background removal and style transfer.
Google Gemini Image Generation API interface for text-to-image, editing, style templates, and automated retry workflows.
Claude Code as an architect: delegate all coding and file edits to the Gemini CLI while maintaining control through planning, verification, and oversight.
Expert skill for implementing the Gemini Interactions API. Use for stateful multi-turn chat, background Deep Research agent tasks, function calling, structured outputs, and modern Python/TypeScript SDK integration.
Generate and edit images, diagrams, and infographics using Google's Gemini 3 Pro model. Supports text-to-image, style transformation, and data-accurate visual creation.
Professional Gemini CLI Skill Architect: specialized in scaffolding new skills, converting Claude Code tools to Gemini, and refactoring/optimizing existing CLI orchestrators.