gemini-api
Google Gemini Image Generation API interface for text-to-image, editing, style templates, and automated retry workflows.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
121 skills found
Google Gemini Image Generation API interface for text-to-image, editing, style templates, and automated retry workflows.
Expert skill for implementing the Gemini Interactions API. Use for stateful multi-turn chat, background Deep Research agent tasks, function calling, structured outputs, and modern Python/TypeScript SDK integration.
CLI interface for Gemini AI, enabling one-shot model inference, text generation, and JSON-formatted data extraction for OpenClaw users.
Process and generate multimedia with Google Gemini. Analyze audio, images, videos, and PDFs with high-context windows. Supports transcription, visual QA, OCR, and AI-driven image creation.
Implement Google Gemini API audio capabilities: process, transcribe, and summarize audio files, analyze environmental sounds, and generate natural speech with controllable TTS.
Implement Google Gemini API vision capabilities for image/document analysis including captioning, object detection, segmentation, and multi-image comparison.
Generate and edit images, diagrams, and infographics using Google's Gemini 3 Pro model. Supports text-to-image, style transformation, and data-accurate visual creation.
Claude Code as an architect: delegate all coding and file edits to the Gemini CLI while maintaining control through planning, verification, and oversight.
Generate and edit images using the Gemini API via the nanaban CLI. Create illustrations, logos, and icons, or perform photo edits like background removal and style transfer.
Generate artistic 3D city-themed food diorama images using Google Gemini API. Creates Pop Mart style four-quadrant layouts featuring iconic dishes, cultural symbols, and city-specific heritage elements.
Generate professional visual assets including app icons, logos, banners, and illustrations using the Nano Banana Pro (Gemini 3 Pro) AI model.
Generate publication-quality statistical plots from CSV or JSON data files using AI-driven automated visualization.