gemini-vision
Implement Google Gemini API vision capabilities for image/document analysis including captioning, object detection, segmentation, and multi-image comparison.
Discover reusable agent skills, browse implementation details, and find the right skill for your workflow.
172 skills found
Implement Google Gemini API vision capabilities for image/document analysis including captioning, object detection, segmentation, and multi-image comparison.
Generate absurdly thorough, professional README.md files for any project, covering local development, system architecture, and deployment instructions.
Expert guidance for Logseq plugin development, specifically optimized for the new database architecture, API integration, and property management.
Develop, test, sign, and publish governance plugins for Memoria using Rhai or gRPC runtimes. Manage the full plugin lifecycle from scaffolding to activation.
Guidance for Model Context Protocol (MCP) server development, including tool design, resource handling, and AI/ML integration patterns.
Search and execute dynamic external tools via the QVeris API for real-time data retrieval, stock market analysis, and web-based tasks.
Read the full text content of a specific note from an Obsidian knowledge base or vault.
Generate hierarchical, token-efficient AGENTS.md files for AI coding agents to provide repository-wide context and project-specific guidelines.
Implement Google Gemini API audio capabilities: process, transcribe, and summarize audio files, analyze environmental sounds, and generate natural speech with controllable TTS.
Manage Jenkins CI/CD pipelines via REST API. Trigger builds, monitor job status, view console logs, and manage nodes and queues directly from your terminal or AI agent.
One-click publishing of Markdown articles to WeChat Official Account drafts, featuring automated image hosting, multi-theme styling, and code syntax highlighting.
Manage automatic model routing for Higress AI Gateway via CLI. Configure triggers for intelligent model selection based on request content.